1
|
Lu Z, Chandra NK. A sparse factor model for clustering high-dimensional longitudinal data. Stat Med 2024; 43:3633-3648. [PMID: 38885953 DOI: 10.1002/sim.10151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 04/09/2024] [Accepted: 06/06/2024] [Indexed: 06/20/2024]
Abstract
Recent advances in engineering technologies have enabled the collection of a large number of longitudinal features. This wealth of information presents unique opportunities for researchers to investigate the complex nature of diseases and uncover underlying disease mechanisms. However, analyzing such kind of data can be difficult due to its high dimensionality, heterogeneity and computational challenges. In this article, we propose a Bayesian nonparametric mixture model for clustering high-dimensional mixed-type (eg, continuous, discrete and categorical) longitudinal features. We employ a sparse factor model on the joint distribution of random effects and the key idea is to induce clustering at the latent factor level instead of the original data to escape the curse of dimensionality. The number of clusters is estimated through a Dirichlet process prior. An efficient Gibbs sampler is developed to estimate the posterior distribution of the model parameters. Analysis of real and simulated data is presented and discussed. Our study demonstrates that the proposed model serves as a useful analytical tool for clustering high-dimensional longitudinal data.
Collapse
Affiliation(s)
- Zihang Lu
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada
- Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
| | - Noirrit Kiran Chandra
- Department of Mathematical Sciences, The University of Texas at Dallas, Richardson, Texas, USA
| |
Collapse
|
2
|
Lu Z, Ahmadiankalati M, Tan Z. Joint clustering multiple longitudinal features: A comparison of methods and software packages with practical guidance. Stat Med 2023; 42:5513-5540. [PMID: 37789706 DOI: 10.1002/sim.9917] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 06/07/2023] [Accepted: 09/13/2023] [Indexed: 10/05/2023]
Abstract
Clustering longitudinal features is a common goal in medical studies to identify distinct disease developmental trajectories. Compared to clustering a single longitudinal feature, integrating multiple longitudinal features allows additional information to be incorporated into the clustering process, which may reveal co-existing longitudinal patterns and generate deeper biological insight. Despite its increasing importance and popularity, there is limited practical guidance for implementing cluster analysis approaches for multiple longitudinal features and evaluating their comparative performance in medical datasets. In this paper, we provide an overview of several commonly used approaches to clustering multiple longitudinal features, with an emphasis on application and implementation through R software. These methods can be broadly categorized into two categories, namely model-based (including frequentist and Bayesian) approaches and algorithm-based approaches. To evaluate their performance, we compare these approaches using real-life and simulated datasets. These results provide practical guidance to applied researchers who are interested in applying these approaches for clustering multiple longitudinal features. Recommendations for applied researchers and suggestions for future research in this area are also discussed.
Collapse
Affiliation(s)
- Zihang Lu
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada
- Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
| | | | - Zhiwen Tan
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada
| |
Collapse
|
3
|
Lin TI, Wang WL. Flexible modeling of multiple nonlinear longitudinal trajectories with censored and non-ignorable missing outcomes. Stat Methods Med Res 2023; 32:593-608. [PMID: 36624626 DOI: 10.1177/09622802221146312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Multivariate nonlinear mixed-effects models (MNLMMs) have become a promising tool for analyzing multi-outcome longitudinal data following nonlinear trajectory patterns. However, such a classical analysis can be challenging due to censorship induced by detection limits of the quantification assay or non-response occurring when participants missed scheduled visits intermittently or discontinued participation. This article proposes an extension of the MNLMM approach, called the MNLMM-CM, by taking the censored and non-ignorable missing responses into account simultaneously. The non-ignorable missingness is described by the selection-modeling factorization to tackle the missing not at random mechanism. A Monte Carlo expectation conditional maximization algorithm coupled with the first-order Taylor approximation is developed for parameter estimation. The techniques for the calculation of standard errors of fixed effects, estimation of unobservable random effects, imputation of censored and missing responses and prediction of future values are also provided. The proposed methodology is motivated and illustrated by the analysis of a clinical HIV/AIDS dataset with censored RNA viral loads and the presence of missing CD4 and CD8 cell counts. The superiority of our method on the provision of more adequate estimation is validated by a simulation study.
Collapse
Affiliation(s)
- Tsung-I Lin
- Institute of Statistics, 34916National Chung Hsing University, Taichung, Taiwan.,Department of Public Health, China Medical University, Taichung, Taiwan
| | - Wan-Lun Wang
- Department of Statistics and Institute of Data Science, 34912National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
4
|
Lin TI, Wang WL. Multivariate linear mixed models with censored and nonignorable missing outcomes, with application to AIDS studies. Biom J 2022; 64:1325-1339. [PMID: 35723051 DOI: 10.1002/bimj.202100233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 04/16/2022] [Accepted: 04/22/2022] [Indexed: 11/08/2022]
Abstract
The analysis of multivariate longitudinal data could encounter some complications due to censorship induced by detection limits of the assay and nonresponse occurring when participants missed scheduled visits intermittently or discontinued participation. This paper establishes a generalization of the multivariate linear mixed model that can accommodate censored responses and nonignorable missing outcomes simultaneously. To account for the nonignorable missingness, the selection approach which decomposes the joint distribution as a marginal distribution for the primary outcome variables and a model describing the missing process conditional on the hypothetical complete data is used. A computationally feasible Monte Carlo expectation conditional maximization algorithm is developed for parameter estimation with the maximum likelihood (ML) method. Furthermore, a general information-based approach is presented to assess the variability of ML estimators. The techniques for the prediction of censored responses and imputation of missing outcomes are also discussed. The methodology is motivated and exemplified by a real dataset concerning HIV-AIDS clinical trials. A simulation study is conducted to examine the performance of the proposed method compared with other traditional approaches.
Collapse
Affiliation(s)
- Tsung-I Lin
- Institute of Statistics, National Chung Hsing University, Taichung, Taiwan.,Department of Public Health, China Medical University, Taichung, Taiwan
| | - Wan-Lun Wang
- Department of Statistics and Institute of Data Science, National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
5
|
Brobbey A, Wiebe S, Nettel-Aguirre A, Josephson CB, Williamson T, Lix LM, Sajobi TT. Repeated measures discriminant analysis using multivariate generalized estimation equations. Stat Methods Med Res 2021; 31:646-657. [PMID: 34898331 PMCID: PMC8961244 DOI: 10.1177/09622802211032705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Discriminant analysis procedures that assume parsimonious covariance and/or means structures have been proposed for distinguishing between two or more populations in multivariate repeated measures designs. However, these procedures rely on the assumptions of multivariate normality which is not tenable in multivariate repeated measures designs which are characterized by binary, ordinal, or mixed types of response distributions. This study investigates the accuracy of repeated measures discriminant analysis (RMDA) based on the multivariate generalized estimating equations (GEE) framework for classification in multivariate repeated measures designs with the same or different types of responses repeatedly measured over time. Monte Carlo methods were used to compare the accuracy of RMDA procedures based on GEE, and RMDA based on maximum likelihood estimators (MLE) under diverse simulation conditions, which included number of repeated measure occasions, number of responses, sample size, correlation structures, and type of response distribution. RMDA based on GEE exhibited higher average classification accuracy than RMDA based on MLE especially in multivariate non-normal distributions. Three repeatedly measured responses namely severity of epilepsy, current number of anti-epileptic drugs, and parent-reported quality of life in children with epilepsy were used to demonstrate the application of these procedures.
Collapse
Affiliation(s)
- Anita Brobbey
- Department of Community Health Sciences, 2129University of Calgary, University of Calgary, Calgary, Canada
| | - Samuel Wiebe
- Department of Community Health Sciences, 2129University of Calgary, University of Calgary, Calgary, Canada.,Department of Clinical Neurosciences, 2129University of Calgary, University of Calgary, Calgary, Canada
| | - Alberto Nettel-Aguirre
- Centre for Health and Social Analytics, 8691University of Wollongong, National Institute for Applied Statistics Research Australia, University of Wollongong, Wollongong, Australia
| | - Colin Bruce Josephson
- Department of Community Health Sciences, 2129University of Calgary, University of Calgary, Calgary, Canada.,Department of Clinical Neurosciences, 2129University of Calgary, University of Calgary, Calgary, Canada
| | - Tyler Williamson
- Department of Community Health Sciences, 2129University of Calgary, University of Calgary, Calgary, Canada
| | - Lisa M Lix
- Department of Community Health Sciences, University of Manitoba, Winnipeg, Canada
| | - Tolulope T Sajobi
- Department of Community Health Sciences, 2129University of Calgary, University of Calgary, Calgary, Canada.,Department of Clinical Neurosciences, 2129University of Calgary, University of Calgary, Calgary, Canada
| |
Collapse
|
6
|
Taavoni M, Arashi M, Wang WL, Lin TI. Multivariate t semiparametric mixed-effects model for longitudinal data with multiple characteristics. J STAT COMPUT SIM 2020. [DOI: 10.1080/00949655.2020.1812608] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- M. Taavoni
- Department of Statistic, Faculty of Mathematical Sciences, Shahrood University of Technology, Shahrood, Iran
| | - M. Arashi
- Department of Statistic, Faculty of Mathematical Sciences, Shahrood University of Technology, Shahrood, Iran
| | - Wan-Lun Wang
- Department of Statistics, Graduate Institute of Statistics and Actuarial Science, Feng Chia University, Taichung, Taiwan
| | - Tsung-I Lin
- Institute of Statistics, National Chung Hsing University, Taichung, Taiwan
- Department of Public Health, China Medical University, Taichung, Taiwan
| |
Collapse
|
7
|
Lin TI, Wang WL. Multivariate- t linear mixed models with censored responses, intermittent missing values and heavy tails. Stat Methods Med Res 2020; 29:1288-1304. [PMID: 31242813 DOI: 10.1177/0962280219857103] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Multivariate longitudinal data arisen in medical studies often exhibit complex features such as censored responses, intermittent missing values, and atypical or outlying observations. The multivariate-t linear mixed model (MtLMM) has been recognized as a powerful tool for robust modeling of multivariate longitudinal data in the presence of potential outliers or fat-tailed noises. This paper presents a generalization of MtLMM, called the MtLMM-CM, to properly adjust for censorship due to detection limits of the assay and missingness embodied within multiple outcome variables recorded at irregular occasions. An expectation conditional maximization either (ECME) algorithm is developed to compute parameter estimates using the maximum likelihood (ML) approach. The asymptotic standard errors of the ML estimators of fixed effects are obtained by inverting the empirical information matrix according to Louis' method. The techniques for the estimation of random effects and imputation of missing responses are also investigated. The proposed methodology is illustrated on two real-world examples from HIV-AIDS studies and a simulation study under a variety of scenarios.
Collapse
Affiliation(s)
- Tsung-I Lin
- Institute of Statistics, National Chung Hsing University, Taichung, Taiwan.,Department of Public Health, China Medical University, Taichung, Taiwan
| | - Wan-Lun Wang
- Department of Statistics, Graduate Institute of Statistics and Actuarial Science, Feng Chia University, Taichung, Taiwan
| |
Collapse
|
8
|
Li HL, Tai PH, Hwang YT, Lin SW. A five-year longitudinal study of the relation between end-stage kidney disease as the outcomes. BMC Nephrol 2020; 21:132. [PMID: 32295526 PMCID: PMC7161172 DOI: 10.1186/s12882-020-01795-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Accepted: 04/05/2020] [Indexed: 11/17/2022] Open
Abstract
Background Patients with end-stage kidney disease (ESKD) are required to undergo consecutive time-based blood and biochemical tests to determine the progression of the disease according to changes in their blood and biochemical data. This study employed a random intercept model to investigate whether time-based blood and biochemical data present any notable clinical meaning that can be used to track disease progression. Methods This study conducted a retrospective analysis on the dialytic data of 148 patients with ESKD, who received hemodialysis between January 2005 and December 2015. The patients were all at least 20 years old, and the data used included patient demographic information and results for at least 60 blood and biochemical tests. A random intercept model was used to analyze the relationships among blood and biochemical test results, explanatory variables of patient comorbidities, and time. Results The age range of patients was between 33 and 98 years, with an average of 66.1 years and those over 65 years old comprising 51.3% (n = 76) of the total. Furthermore, hypertension was found to be the most common comorbidity among patients (87.2%, n = 129), followed by anemia (48.6%, n = 72), diabetes (47.3%, n = 70), dyslipidemia (19.6%, n = 29), and peptic ulcer (19.6%, n = 29). Coronary atherosclerotic heart disease is a comorbidity that can serve as a strong and independent marker for prognosis in patients with ESKD. Serum creatinine level can serve as an alternative indicator because patients with ESKD and comorbid diabetes may exhibit increased creatinine levels. Conclusions The results of a parameter estimation for longitudinal data analysis suggested that comorbidity and time were critical variables influencing blood and biochemical test results. Furthermore, WBC and HBC, HCT, albumin, protein, and creatinine levels were recognized as variables of critical significance. The results obtained in this study indicate that multimorbidity increases the treatment burden on patients, leading to polypharmacy. For this reason, comprehensive care and treatment of ESKD cannot rely solely on data from one single time point; instead, longitudinal analysis and other data that can affect patient prognosis must also be considered.
Collapse
Affiliation(s)
- Hsiu-Lan Li
- Graduate Institute of Business and Management, Chang Gung University, Taoyuan City, Taiwan
| | - Pei-Hui Tai
- Department of Nursing, En Cku Kong Hospital, New Taipei City, Taiwan
| | - Yi-Ting Hwang
- Department of Statistics, National Taipei University, New Taipei City, Taiwan
| | - Shih-Wei Lin
- Department of Information Management, Chang Gung University, 259 Wen-Hwa 1st Road, Kwei-Shan, Tao-Yuan, 333, Taiwan. .,Department of Neurology, Linkou Chang Gung Memorial Hospital, Taoyuan City, Taiwan. .,Department of Industrial Engineering and Management, Ming Chi University of Technology, New Taipei City, Taiwan.
| |
Collapse
|
9
|
Mixture of multivariate t nonlinear mixed models for multiple longitudinal data with heterogeneity and missing values. TEST-SPAIN 2018. [DOI: 10.1007/s11749-018-0612-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
10
|
Rajeswaran J, Blackstone EH, Barnard J. Evolution of association between renal and liver functions while awaiting heart transplant: An application using a bivariate multiphase nonlinear mixed effects model. Stat Methods Med Res 2018; 27:2216-2230. [PMID: 27856959 PMCID: PMC5433933 DOI: 10.1177/0962280216678022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In many longitudinal follow-up studies, we observe more than one longitudinal outcome. Impaired renal and liver functions are indicators of poor clinical outcomes for patients who are on mechanical circulatory support and awaiting heart transplant. Hence, monitoring organ functions while waiting for heart transplant is an integral part of patient management. Longitudinal measurements of bilirubin can be used as a marker for liver function and glomerular filtration rate for renal function. We derive an approximation to evolution of association between these two organ functions using a bivariate nonlinear mixed effects model for continuous longitudinal measurements, where the two submodels are linked by a common distribution of time-dependent latent variables and a common distribution of measurement errors.
Collapse
Affiliation(s)
- Jeevanantham Rajeswaran
- Department of Quantitative Health Sciences, and Heart and Vascular Institute, Cleveland Clinic, Cleveland, USA
| | - Eugene H Blackstone
- Department of Quantitative Health Sciences, and Heart and Vascular Institute, Cleveland Clinic, Cleveland, USA
| | - John Barnard
- Department of Quantitative Health Sciences, and Heart and Vascular Institute, Cleveland Clinic, Cleveland, USA
| |
Collapse
|
11
|
Lin TI, Lachos VH, Wang WL. Multivariate longitudinal data analysis with censored and intermittent missing responses. Stat Med 2018; 37:2822-2835. [PMID: 29740829 DOI: 10.1002/sim.7692] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Revised: 03/31/2018] [Accepted: 04/02/2018] [Indexed: 11/08/2022]
Abstract
The multivariate linear mixed model (MLMM) has emerged as an important analytical tool for longitudinal data with multiple outcomes. However, the analysis of multivariate longitudinal data could be complicated by the presence of censored measurements because of a detection limit of the assay in combination with unavoidable missing values arising when subjects miss some of their scheduled visits intermittently. This paper presents a generalization of the MLMM approach, called the MLMM-CM, for a joint analysis of the multivariate longitudinal data with censored and intermittent missing responses. A computationally feasible expectation maximization-based procedure is developed to carry out maximum likelihood estimation within the MLMM-CM framework. Moreover, the asymptotic standard errors of fixed effects are explicitly obtained via the information-based method. We illustrate our methodology by using simulated data and a case study from an AIDS clinical trial. Experimental results reveal that the proposed method is able to provide more satisfactory performance as compared with the traditional MLMM approach.
Collapse
Affiliation(s)
- Tsung-I Lin
- Institute of Statistics, National Chung Hsing University, Taichung 402, Taiwan
- Department of Public Health, China Medical University, Taichung 404, Taiwan
| | - Victor H Lachos
- Department of Statistics, University of Connecticut, Storrs, CT 06269, USA
| | - Wan-Lun Wang
- Department of Statistics, Graduate Institute of Statistics and Actuarial Science, Feng Chia University, Taichung 40724, Taiwan
| |
Collapse
|
12
|
Wagner BD, Kroehl M, Gan R, Mikulich-Gilbertson SK, Sagel SD, Riggs PD, Brown T, Snell-Bergeon J, Zerbe GO. A Multivariate Generalized Linear Model Approach to Mediation Analysis and Application of Confidence Ellipses. STATISTICS IN BIOSCIENCES 2017. [DOI: 10.1007/s12561-017-9191-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
13
|
Luwanda AG, Mwambi HG. A nonlinear mixed-effects model for multivariate longitudinal data with partially observed outcomes with application to HIV disease dynamics. J Appl Stat 2017. [DOI: 10.1080/02664763.2016.1177494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
14
|
Milla J, Martín ES, Van Bellegem S. Higher Education Value Added Using Multiple Outcomes. JOURNAL OF EDUCATIONAL MEASUREMENT 2016. [DOI: 10.1111/jedm.12114] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
15
|
de la Cruz R, Fuentes C, Meza C, Núñez-Antón V. Error-rate estimation in discriminant analysis of non-linear longitudinal data: A comparison of resampling methods. Stat Methods Med Res 2016; 27:1153-1167. [DOI: 10.1177/0962280216656246] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Consider longitudinal observations across different subjects such that the underlying distribution is determined by a non-linear mixed-effects model. In this context, we look at the misclassification error rate for allocating future subjects using cross-validation, bootstrap algorithms (parametric bootstrap, leave-one-out, .632 and [Formula: see text]), and bootstrap cross-validation (which combines the first two approaches), and conduct a numerical study to compare the performance of the different methods. The simulation and comparisons in this study are motivated by real observations from a pregnancy study in which one of the main objectives is to predict normal versus abnormal pregnancy outcomes based on information gathered at early stages. Since in this type of studies it is not uncommon to have insufficient data to simultaneously solve the classification problem and estimate the misclassification error rate, we put special attention to situations when only a small sample size is available. We discuss how the misclassification error rate estimates may be affected by the sample size in terms of variability and bias, and examine conditions under which the misclassification error rate estimates perform reasonably well.
Collapse
Affiliation(s)
- Rolando de la Cruz
- Institute of Statistics, Pontificia Universidad Católica de Valparaíso, Chile
| | - Claudio Fuentes
- Department of Statistics, Oregon State University, Corvallis, OR, USA
| | - Cristian Meza
- CIMFAV, Facultad de Ingeniería, Universidad de Valparaíso, Valparaíso, Chile
| | - Vicente Núñez-Antón
- Department of Econometrics and Statistics (A.E.III), University of the Basque Country UPV/EHU, Bilbao, Spain
| |
Collapse
|
16
|
Approximate Methods for Maximum Likelihood Estimation of Multivariate Nonlinear Mixed-Effects Models. ENTROPY 2015. [DOI: 10.3390/e17085353] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
17
|
Mikulich-Gilbertson SK, Wagner BD, Riggs PD, Zerbe GO. On estimating and testing associations between random coefficients from multivariate generalized linear mixed models of longitudinal outcomes. Stat Methods Med Res 2015; 26:1130-1145. [PMID: 25636408 DOI: 10.1177/0962280214568522] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Different types of outcomes (e.g. binary, count, continuous) can be simultaneously modeled with multivariate generalized linear mixed models by assuming: (1) same or different link functions, (2) same or different conditional distributions, and (3) conditional independence given random subject effects. Others have used this approach for determining simple associations between subject-specific parameters (e.g. correlations between slopes). We demonstrate how more complex associations (e.g. partial regression coefficients between slopes adjusting for intercepts, time lags of maximum correlation) can be estimated. Reparameterizing the model to directly estimate coefficients allows us to compare standard errors based on the inverse of the Hessian matrix with more usual standard errors approximated by the delta method; a mathematical proof demonstrates their equivalence when the gradient vector approaches zero. Reparameterization also allows us to evaluate significance of coefficients with likelihood ratio tests and to compare this approach with more usual Wald-type t-tests and Fisher's z transformations. Simulations indicate that the delta method and inverse Hessian standard errors are nearly equivalent and consistently overestimate the true standard error. Only the likelihood ratio test based on the reparameterized model has an acceptable type I error rate and is therefore recommended for testing associations between stochastic parameters. Online supplementary materials include our medical data example, annotated code, and simulation details.
Collapse
Affiliation(s)
- Susan K Mikulich-Gilbertson
- 1 Department of Psychiatry, School of Medicine, University of Colorado Anschutz Medical Center, Aurora, USA.,2 Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Center, Aurora, USA
| | - Brandie D Wagner
- 2 Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Center, Aurora, USA
| | - Paula D Riggs
- 1 Department of Psychiatry, School of Medicine, University of Colorado Anschutz Medical Center, Aurora, USA
| | - Gary O Zerbe
- 2 Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Center, Aurora, USA
| |
Collapse
|
18
|
Arribas-Gil A, De la Cruz R, Lebarbier E, Meza C. Classification of longitudinal data through a semiparametric mixed-effects model based on lasso-type estimators. Biometrics 2015; 71:333-43. [DOI: 10.1111/biom.12280] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 10/01/2014] [Accepted: 11/01/2014] [Indexed: 11/28/2022]
Affiliation(s)
- Ana Arribas-Gil
- Departamento de Estadística; Universidad Carlos III de Madrid; Getafe Spain
| | - Rolando De la Cruz
- Advanced Center for Chronic Diseases (ACCDiS) and Department of Public Health, School of Medicine; and Department of Statistics, Faculty of Mathematics; Pontificia Universidad Católica de Chile; Santiago Chile
| | | | - Cristian Meza
- CIMFAV-Facultad de Ingeniería; Universidad de Valparaíso; Valparaíso Chile
| |
Collapse
|
19
|
Wang WL, Lin TI. Bayesian analysis of multivariatetlinear mixed models with missing responses at random. J STAT COMPUT SIM 2014. [DOI: 10.1080/00949655.2014.989852] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
|
20
|
Wang WL, Lin TI. Multivariate t
nonlinear mixed-effects models for multi-outcome longitudinal data with missing values. Stat Med 2014; 33:3029-46. [DOI: 10.1002/sim.6144] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Revised: 01/27/2014] [Accepted: 02/15/2014] [Indexed: 11/09/2022]
Affiliation(s)
- Wan-Lun Wang
- Department of Statistics; Graduate Institute of Statistics and Actuarial Science, Feng Chia University; Taichung 40724 Taiwan
| | - Tsung-I Lin
- Department of Applied Mathematics; Institute of Statistics, National Chung Hsing University; Taichung 402 Taiwan
- Department of Public Health; China Medical University; Taichung 404 Taiwan
| |
Collapse
|
21
|
Kuramoto L, Cragg J, Nandhagopal R, Mak E, Sossi V, de la Fuente-Fernández R, Stoessl AJ, Schulzer M. The nature of progression in Parkinson's disease: an application of non-linear, multivariate, longitudinal random effects modelling. PLoS One 2013; 8:e76595. [PMID: 24204641 PMCID: PMC3799835 DOI: 10.1371/journal.pone.0076595] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Accepted: 09/02/2013] [Indexed: 11/18/2022] Open
Abstract
Background To date, statistical methods that take into account fully the non-linear, longitudinal and multivariate aspects of clinical data have not been applied to the study of progression in Parkinson’s disease (PD). In this paper, we demonstrate the usefulness of such methodology for studying the temporal and spatial aspects of the progression of PD. Extending this methodology further, we also explore the presymptomatic course of this disease. Methods Longitudinal Positron Emission Tomography (PET) measurements were collected on 78 PD patients, from 4 subregions on each side of the brain, using 3 different radiotracers. Non-linear, multivariate, longitudinal random effects modelling was applied to analyze and interpret these data. Results The data showed a non-linear decline in PET measurements, which we modelled successfully by an exponential function depending on two patient-related covariates duration since symptom onset and age at symptom onset. We found that the degree of damage was significantly greater in the posterior putamen than in the anterior putamen throughout the disease. We also found that over the course of the illness, the difference between the less affected and more affected sides of the brain decreased in the anterior putamen. Younger patients had significantly poorer measurements than older patients at the time of symptom onset suggesting more effective compensatory mechanisms delaying the onset of symptoms. Cautious extrapolation showed that disease onset had occurred some 8 to 17 years prior to symptom onset. Conclusions Our model provides important biological insights into the pathogenesis of PD, as well as its preclinical aspects. Our methodology can be applied widely to study many other chronic progressive diseases.
Collapse
Affiliation(s)
- Lisa Kuramoto
- Centre for Clinical Epidemiology & Evaluation, Vancouver Coastal Health Research Institute, Vancouver, British Columbia, Canada
- * E-mail:
| | - Jacquelyn Cragg
- Pacific Parkinson’s Research Centre, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Ramachandiran Nandhagopal
- Pacific Parkinson’s Research Centre, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Edwin Mak
- Pacific Parkinson’s Research Centre, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Vesna Sossi
- Department of Physics & Astronomy, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Raul de la Fuente-Fernández
- Pacific Parkinson’s Research Centre, The University of British Columbia, Vancouver, British Columbia, Canada
| | - A. Jon Stoessl
- Pacific Parkinson’s Research Centre, The University of British Columbia, Vancouver, British Columbia, Canada
| | - Michael Schulzer
- Centre for Clinical Epidemiology & Evaluation, Vancouver Coastal Health Research Institute, Vancouver, British Columbia, Canada
- Pacific Parkinson’s Research Centre, The University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
22
|
Wang WL. Multivariate t linear mixed models for irregularly observed multiple repeated measures with missing outcomes. Biom J 2013; 55:554-71. [PMID: 23740830 DOI: 10.1002/bimj.201200001] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2012] [Revised: 03/27/2013] [Accepted: 03/30/2013] [Indexed: 11/08/2022]
Abstract
Missing outcomes or irregularly timed multivariate longitudinal data frequently occur in clinical trials or biomedical studies. The multivariate t linear mixed model (MtLMM) has been shown to be a robust approach to modeling multioutcome continuous repeated measures in the presence of outliers or heavy-tailed noises. This paper presents a framework for fitting the MtLMM with an arbitrary missing data pattern embodied within multiple outcome variables recorded at irregular occasions. To address the serial correlation among the within-subject errors, a damped exponential correlation structure is considered in the model. Under the missing at random mechanism, an efficient alternating expectation-conditional maximization (AECM) algorithm is used to carry out estimation of parameters and imputation of missing values. The techniques for the estimation of random effects and the prediction of future responses are also investigated. Applications to an HIV-AIDS study and a pregnancy study involving analysis of multivariate longitudinal data with missing outcomes as well as a simulation study have highlighted the superiority of MtLMMs on the provision of more adequate estimation, imputation and prediction performances.
Collapse
Affiliation(s)
- Wan-Lun Wang
- Department of Statistics, Graduate Institute of Statistics and Actuarial Science, Feng Chia University, Taichung 40724, Taiwan.
| |
Collapse
|
23
|
Nandhagopal R, Kuramoto L, Schulzer M, Mak E, Cragg J, McKenzie J, McCormick S, Ruth TJ, Sossi V, de la Fuente-Fernandez R, Stoessl AJ. Longitudinal evolution of compensatory changes in striatal dopamine processing in Parkinson's disease. ACTA ACUST UNITED AC 2012; 134:3290-8. [PMID: 22075521 DOI: 10.1093/brain/awr233] [Citation(s) in RCA: 121] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Parkinson's disease is a relentlessly progressive neurodegenerative disease. Breakdown of compensatory mechanisms influencing putaminal dopamine processing could contribute to the progressive motor symptoms. We studied a cohort of 78 subjects (at baseline) with sporadic Parkinson's disease and 35 healthy controls with multi-tracer positron emission tomography scans to investigate the evolution of adaptive mechanisms influencing striatal dopamine processing in Parkinson's disease progression. Presynaptic dopaminergic integrity was assessed with three radioligands: (i) [(11)C](±)dihydrotetrabenazine, to estimate the density of vesicular monoamine transporter type 2; (ii) [(11)C]d-threo-methylphenidate, to label the dopamine transporter; and (iii) 6-[(18)F]fluoro-L-DOPA, to assess the activity of aromatic amino acid decarboxylase and storage of 6-[(18)F]-fluorodopamine in synaptic vesicles. The subjects with Parkinson's disease and the healthy controls underwent positron emission tomography scans at the initial visit and after 4 and 8 years of follow-up. Non-linear multivariate regression analyses with random effects were utilized to model the longitudinal changes in tracer values in the putamen standardized relative to normal controls. We found evidence for possible upregulation of dopamine synthesis and downregulation of dopamine transporter in the more severely affected putamen in the early stage of Parkinson's disease. The standardized 6-[(18)F]fluoro-L-DOPA and [(11)C]d-threo-methylphenidate values tended to approach [(11)C](±)dihydrotetrabenazine values in the putamen in later stages of disease (i.e. for [(11)C](±)dihydrotetrabenazine values <25% of normal), when the rates of decline in the positron emission tomography measurements were similar for all the markers. Our data suggest that compensatory mechanisms decline as Parkinson's disease progresses. This breakdown of compensatory strategies in the putamen could contribute to the progression of motor symptoms in advanced disease.
Collapse
Affiliation(s)
- Ramachandiran Nandhagopal
- Pacific Parkinson's Research Centre, University of British Columbia and Vancouver Coastal Health, Vancouver, BC V6T 2B5, Canada
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
de la Fuente-Fernández R, Schulzer M, Kuramoto L, Cragg J, Ramachandiran N, Au WL, Mak E, McKenzie J, McCormick S, Sossi V, Ruth TJ, Lee CS, Calne DB, Stoessl AJ. Age-specific progression of nigrostriatal dysfunction in Parkinson's disease. Ann Neurol 2011; 69:803-10. [PMID: 21246604 DOI: 10.1002/ana.22284] [Citation(s) in RCA: 179] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2010] [Revised: 08/18/2010] [Accepted: 09/24/2010] [Indexed: 11/06/2022]
Abstract
OBJECTIVE To investigate in vivo the impact of age on nigrostriatal dopamine dysfunction in Parkinson's disease (PD). METHODS PD patients (n = 78) and healthy control subjects (n = 35) underwent longitudinal positron emission tomography assessments using 3 presynaptic dopamine markers: (1) [¹¹C](±)dihydrotetrabenazine (DTBZ), to estimate the density of the vesicular monoamine transporter type 2; (2) [¹¹C]d-threo-methylphenidate, to estimate the density of the plasma membrane dopamine transporter; and (3) 6-[¹⁸F]-fluoro-L-dopa, to estimate the activity of the enzyme dopa-decarboxylase. RESULTS The study comprised 438 PD scans and 241 control scans (679 scans in total). At symptom onset, the loss of putamen DTBZ binding was substantially greater in younger compared to older PD patients (p = 0.015). Remarkably, however, the rate of progression of DTBZ binding loss was significantly slower in younger patients (p < 0.05). The estimated presymptomatic phase of the disease spanned more than 2 decades in younger patients, compared to 1 decade in older patients. INTERPRETATION Our results suggest that, compared to older patients, younger PD patients progress more slowly and are able to endure more damage to the dopaminergic system before the first motor symptoms appear. These observations suggest that younger PD patients have more efficient compensatory mechanisms.
Collapse
|
25
|
Nandhagopal R, Kuramoto L, Schulzer M, Mak E, Cragg J, Lee CS, McKenzie J, McCormick S, Samii A, Troiano A, Ruth TJ, Sossi V, de la Fuente-Fernandez R, Calne DB, Stoessl AJ. Longitudinal progression of sporadic Parkinson's disease: a multi-tracer positron emission tomography study. Brain 2009; 132:2970-9. [DOI: 10.1093/brain/awp209] [Citation(s) in RCA: 190] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
26
|
Ghisletta P. Application of a joint multivariate longitudinal-survival analysis to examine the terminal decline hypothesis in the Swiss Interdisciplinary Longitudinal Study on the Oldest Old. J Gerontol B Psychol Sci Soc Sci 2008; 63:P185-92. [PMID: 18559684 DOI: 10.1093/geronb/63.3.p185] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In this work I aim at extending current knowledge on the terminal decline hypothesis by applying a joint multivariate longitudinal-survival analysis to the cognitive data of the Swiss Interdisciplinary Longitudinal Study on the Oldest Old. (In that study, 529 individuals between 79 and 85 years of age at study inception were assessed up to five times on a task of perceptual speed and one of verbal fluency.) I simultaneously estimated a multivariate, multilevel longitudinal model and a Weibull survival model to test whether individual performance and change in speed and fluency predict survival, controlling for retest effects, initial age, gender, overall health, socioeconomic status, and sensory functioning. Results revealed that age and performance level in fluency predicted survival, whereas level in speed and change in both cognitive variables did not. I discuss the relevance of fluency tasks in predicting mortality.
Collapse
Affiliation(s)
- Paolo Ghisletta
- Faculty of Psychology and Educational Sciences, University of Geneva, Boulevard du Pont d'Arve 40, 1211 Genève 4, Switzerland.
| |
Collapse
|
27
|
Marshall G, De la Cruz-Mesía R, Quintana FA, Barón AE. Discriminant analysis for longitudinal data with multiple continuous responses and possibly missing data. Biometrics 2008; 65:69-80. [PMID: 18363774 DOI: 10.1111/j.1541-0420.2008.01016.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Multiple outcomes are often used to properly characterize an effect of interest. This article discusses model-based statistical methods for the classification of units into one of two or more groups where, for each unit, repeated measurements over time are obtained on each outcome. We relate the observed outcomes using multivariate nonlinear mixed-effects models to describe evolutions in different groups. Due to its flexibility, the random-effects approach for the joint modeling of multiple outcomes can be used to estimate population parameters for a discriminant model that classifies units into distinct predefined groups or populations. Parameter estimation is done via the expectation-maximization algorithm with a linear approximation step. We conduct a simulation study that sheds light on the effect that the linear approximation has on classification results. We present an example using data from a study in 161 pregnant women in Santiago, Chile, where the main interest is to predict normal versus abnormal pregnancy outcomes.
Collapse
Affiliation(s)
- Guillermo Marshall
- Departamento de Estadística, Facultad de Matemáticas, Pontificia Universidad Católica de Chile, Casilla 306, Correo 22, Santiago, Chile
| | | | | | | |
Collapse
|