1
|
Rustand D, van Niekerk J, Krainski ET, Rue H, Proust-Lima C. Fast and flexible inference for joint models of multivariate longitudinal and survival data using integrated nested Laplace approximations. Biostatistics 2024; 25:429-448. [PMID: 37531620 PMCID: PMC11017128 DOI: 10.1093/biostatistics/kxad019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 07/12/2023] [Accepted: 07/13/2023] [Indexed: 08/04/2023] Open
Abstract
Modeling longitudinal and survival data jointly offers many advantages such as addressing measurement error and missing data in the longitudinal processes, understanding and quantifying the association between the longitudinal markers and the survival events, and predicting the risk of events based on the longitudinal markers. A joint model involves multiple submodels (one for each longitudinal/survival outcome) usually linked together through correlated or shared random effects. Their estimation is computationally expensive (particularly due to a multidimensional integration of the likelihood over the random effects distribution) so that inference methods become rapidly intractable, and restricts applications of joint models to a small number of longitudinal markers and/or random effects. We introduce a Bayesian approximation based on the integrated nested Laplace approximation algorithm implemented in the R package R-INLA to alleviate the computational burden and allow the estimation of multivariate joint models with fewer restrictions. Our simulation studies show that R-INLA substantially reduces the computation time and the variability of the parameter estimates compared with alternative estimation strategies. We further apply the methodology to analyze five longitudinal markers (3 continuous, 1 count, 1 binary, and 16 random effects) and competing risks of death and transplantation in a clinical trial on primary biliary cholangitis. R-INLA provides a fast and reliable inference technique for applying joint models to the complex multivariate data encountered in health research.
Collapse
Affiliation(s)
- Denis Rustand
- Statistics Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Janet van Niekerk
- Statistics Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Elias Teixeira Krainski
- Statistics Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Håvard Rue
- Statistics Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Cécile Proust-Lima
- Bordeaux Population Health Center, Inserm, UMR1219, Univ. Bordeaux, F-33000 Bordeaux, France
| |
Collapse
|
2
|
Wang C, Shen J, Charalambous C, Pan J. Modeling biomarker variability in joint analysis of longitudinal and time-to-event data. Biostatistics 2024; 25:577-596. [PMID: 37230468 PMCID: PMC11017116 DOI: 10.1093/biostatistics/kxad009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Revised: 04/18/2023] [Accepted: 04/21/2023] [Indexed: 05/27/2023] Open
Abstract
The role of visit-to-visit variability of a biomarker in predicting related disease has been recognized in medical science. Existing measures of biological variability are criticized for being entangled with random variability resulted from measurement error or being unreliable due to limited measurements per individual. In this article, we propose a new measure to quantify the biological variability of a biomarker by evaluating the fluctuation of each individual-specific trajectory behind longitudinal measurements. Given a mixed-effects model for longitudinal data with the mean function over time specified by cubic splines, our proposed variability measure can be mathematically expressed as a quadratic form of random effects. A Cox model is assumed for time-to-event data by incorporating the defined variability as well as the current level of the underlying longitudinal trajectory as covariates, which, together with the longitudinal model, constitutes the joint modeling framework in this article. Asymptotic properties of maximum likelihood estimators are established for the present joint model. Estimation is implemented via an Expectation-Maximization (EM) algorithm with fully exponential Laplace approximation used in E-step to reduce the computation burden due to the increase of the random effects dimension. Simulation studies are conducted to reveal the advantage of the proposed method over the two-stage method, as well as a simpler joint modeling approach which does not take into account biomarker variability. Finally, we apply our model to investigate the effect of systolic blood pressure variability on cardiovascular events in the Medical Research Council elderly trial, which is also the motivating example for this article.
Collapse
Affiliation(s)
- Chunyu Wang
- Department of Mathematics, The University of Manchester, Manchester, M13 9PL, UK
- MRC Biostatistics Unit, University of Cambridge, Cambridge, CB2 0SR, UK
| | - Jiaming Shen
- Department of Mathematics, The University of Manchester, Manchester, M13 9PL, UK
| | | | - Jianxin Pan
- Research Center for Mathematics, Beijing Normal University, Zhuhai, China
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, China
| |
Collapse
|
3
|
Devaux A, Helmer C, Genuer R, Proust-Lima C. Random survival forests with multivariate longitudinal endogenous covariates. Stat Methods Med Res 2023; 32:2331-2346. [PMID: 37886845 DOI: 10.1177/09622802231206477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
Predicting the individual risk of clinical events using the complete patient history is a major challenge in personalized medicine. Analytical methods have to account for a possibly large number of time-dependent predictors, which are often characterized by irregular and error-prone measurements, and are truncated early by the event. In this work, we extended the competing-risk random survival forests to handle such endogenous longitudinal predictors when predicting event probabilities. The method, implemented in the R package DynForest, internally transforms the time-dependent predictors at each node of each tree into time-fixed features (using mixed models) that can then be used as splitting candidates. The final individual event probability is computed as the average of leaf-specific Aalen-Johansen estimators over the trees. Using simulations, we compared the performances of DynForest to accurately predict an event with (i) a joint modeling alternative when considering two longitudinal predictors only, and with (ii) a regression calibration method that ignores the informative truncation by the event when dealing with a large number of longitudinal predictors. Through an application in dementia research, we also illustrated how DynForest can be used to develop a dynamic prediction tool for dementia from multimodal repeated markers, and quantify the importance of each marker.
Collapse
Affiliation(s)
- Anthony Devaux
- Univ. Bordeaux, INSERM, BPH, U1219, Bordeaux, France
- The George Institute for Global Health, UNSW Sydney, Australia
- School of Population Health, UNSW Sydney, Australia
| | | | - Robin Genuer
- Univ. Bordeaux, INSERM, INRIA, BPH, U1219, Bordeaux, France
| | | |
Collapse
|
4
|
Brossard M, Paterson AD, Espin-Garcia O, Craiu RV, Bull SB. Characterization of direct and/or indirect genetic associations for multiple traits in longitudinal studies of disease progression. Genetics 2023; 225:iyad119. [PMID: 37369448 DOI: 10.1093/genetics/iyad119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 06/07/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open
Abstract
When quantitative longitudinal traits are risk factors for disease progression and subject to random biological variation, joint model analysis of time-to-event and longitudinal traits can effectively identify direct and/or indirect genetic association of single nucleotide polymorphisms (SNPs) with time-to-event. We present a joint model that integrates: (1) a multivariate linear mixed model describing trajectories of multiple longitudinal traits as a function of time, SNP effects, and subject-specific random effects and (2) a frailty Cox survival model that depends on SNPs, longitudinal trajectory effects, and subject-specific frailty accounting for dependence among multiple time-to-event traits. Motivated by complex genetic architecture of type 1 diabetes complications (T1DC) observed in the Diabetes Control and Complications Trial (DCCT), we implement a 2-stage approach to inference with bootstrap joint covariance estimation and develop a hypothesis testing procedure to classify direct and/or indirect SNP association with each time-to-event trait. By realistic simulation study, we show that joint modeling of 2 time-to-T1DC (retinopathy and nephropathy) and 2 longitudinal risk factors (HbA1c and systolic blood pressure) reduces estimation bias in genetic effects and improves classification accuracy of direct and/or indirect SNP associations, compared to methods that ignore within-subject risk factor variability and dependence among longitudinal and time-to-event traits. Through DCCT data analysis, we demonstrate feasibility for candidate SNP modeling and quantify effects of sample size and Winner's curse bias on classification for 2 SNPs identified as having indirect associations with time-to-T1DC traits. Joint analysis of multiple longitudinal and multiple time-to-event traits provides insight into complex traits architecture.
Collapse
Affiliation(s)
- Myriam Brossard
- Prosserman Centre for Population Health Research, Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto M5T 3L9, Ontario, Canada
| | - Andrew D Paterson
- Program in Genetics and Genome Biology, Hospital for Sick Children Research Institute, Toronto M5G 1X8, Ontario, Canada
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto M5T 3M7, Ontario, Canada
| | - Osvaldo Espin-Garcia
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto M5T 3M7, Ontario, Canada
- Department of Biostatistics, Princess Margaret Cancer Centre, Toronto M5G 2C1, Ontario, Canada
- Department of Statistical Sciences, University of Toronto, Toronto M5S 3G3, Ontario, Canada
- Department of Epidemiology and Biostatistics, Western University, London N6A 5C1, Ontario, Canada
| | - Radu V Craiu
- Department of Statistical Sciences, University of Toronto, Toronto M5S 3G3, Ontario, Canada
| | - Shelley B Bull
- Prosserman Centre for Population Health Research, Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto M5T 3L9, Ontario, Canada
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto M5T 3M7, Ontario, Canada
| |
Collapse
|
5
|
Liu D, Wu E, Shih JH, Kitahara CM, Cheung LC. Absolute and relative risk estimation in the presence of outcome ascertainment gaps and competing risks. Stat Med 2023; 42:1263-1276. [PMID: 36705055 DOI: 10.1002/sim.9668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 08/23/2022] [Accepted: 10/13/2022] [Indexed: 01/28/2023]
Abstract
Incomplete coverage by cancer registries can lead to an underreporting of cancers and a resulting bias in risk estimates. When registries are defined by geographic region, gaps in observation can arise for individuals who reside outside of or migrate from the total registry catchment area. Moreover, the exact periods of non-observation for an individual may be unknown due to intermittent reporting of residential histories. The motivating example for this work is the U.S. Radiologic Technologist (USRT) study which ascertained cancer outcomes for a national cohort through 43 state/regional registries; similar gaps in outcome ascertainment can appear in other registry or electronic health record- based cohort studies. We propose a two-step procedure for estimating relative and absolute risk in these settings. First, using a mover stayer model fitted to individuals' known residential history, we obtain individual posterior probabilities of residing outside the registry catchment area each year. Second, we incorporate these probabilities in the survival data likelihood for competing risks to account for unobserved events. We assess the performance of the proposed method in extensive simulation studies. Compared to several simple alternative approaches, the proposed method reduces bias and improves efficiency. Finally, we apply the proposed method to a study of first primary lung cancers in the USRT cohort.
Collapse
Affiliation(s)
- Danping Liu
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Emily Wu
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Joanna H Shih
- Biometric Research Branch, Division of Cancer Treatment and Diagnosis, National Cancer Institute, Rockville, Maryland, USA
| | - Cari M Kitahara
- Radiation Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Li C Cheung
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
6
|
Longitudinal uric acid has nonlinear association with kidney failure and mortality in chronic kidney disease. Sci Rep 2023; 13:3952. [PMID: 36894586 PMCID: PMC9998636 DOI: 10.1038/s41598-023-30902-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 03/03/2023] [Indexed: 03/11/2023] Open
Abstract
We investigated the shape of the relationship between longitudinal uric acid (UA) and the hazard of kidney failure and death in chronic kidney disease (CKD) patients, and attempted to identify thresholds associated with increased hazards. We included CKD stage 3-5 patients from the CKD-REIN cohort with one serum UA measurement at cohort entry. We used cause-specific multivariate Cox models including a spline function of current values of UA (cUA), estimated from a separate linear mixed model. We followed 2781 patients (66% men, median age, 69 years) for a median of 3.2 years with a median of five longitudinal UA measures per patient. The hazard of kidney failure increased with increasing cUA, with a plateau between 6 and 10 mg/dl and a sharp increase above 11 mg/dl. The hazard of death had a U-shape relationship with cUA, with a hazard twice higher for 3 or 11 mg/dl, compared to 5 mg/dl. In CKD patients, our results indicate that UA above 10 mg/dl is a strong risk marker for kidney failure and death and that low UA levels below 5 mg/dl are associated with death before kidney failure.
Collapse
|
7
|
Dong J(J, Shi H, Wang L, Zhang Y, Cao J. Jointly modelling multiple transplant outcomes by a competing risk model via functional principal component analysis. J Appl Stat 2023; 50:43-59. [DOI: 10.1080/02664763.2021.1981256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Jianghu (James) Dong
- Department of Biostatistics, College of Public Health, University of Nebraska Medical Center, Omaha, Nebraska, USA
- Division of Nephrology, Department of Medicine, University of Nebraska Medical Center, Omaha, Nebraska, USA
| | - Haolun Shi
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, Canada
| | - Liangliang Wang
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, Canada
| | - Ying Zhang
- Department of Biostatistics, College of Public Health, University of Nebraska Medical Center, Omaha, Nebraska, USA
| | - Jiguo Cao
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
8
|
Mchunu NN, Mwambi HG, Rizopoulos D, Reddy T, Yende-Zuma N. Using joint models to study the association between CD4 count and the risk of death in TB/HIV data. BMC Med Res Methodol 2022; 22:295. [PMID: 36401214 PMCID: PMC9675185 DOI: 10.1186/s12874-022-01775-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Accepted: 10/26/2022] [Indexed: 11/20/2022] Open
Abstract
Background The association structure linking the longitudinal and survival sub-models is of fundamental importance in the joint modeling framework and the choice of this structure should be made based on the clinical background of the study. However, this information may not always be accessible and rationale for selecting this association structure has received relatively little attention in the literature. To this end, we aim to explore four alternative functional forms of the association structure between the CD4 count and the risk of death and provide rationale for selecting the optimal association structure for our data. We also aim to compare the results obtained from the joint model to those obtained from the time-varying Cox model. Methods We used data from the Centre for the AIDS Programme of Research in South Africa (CAPRISA) AIDS Treatment programme, the Starting Antiretroviral Therapy at Three Points in Tuberculosis (SAPiT) study, an open-label, three armed randomised, controlled trial between June 2005 and July 2010 (N=642). In our analysis, we combined the early and late integrated arms and compared results to the sequential arm. We utilized the Deviance Information Criterion (DIC) to select the final model with the best structure, with smaller values indicating better model adjustments to the data. Results Patient characteristics were similar across the study arms. Combined integrated therapy arms had a reduction of 55% in mortality (HR:0.45, 95% CI:0.28-0.72) compared to the sequential therapy arm. The joint model with a cumulative effects functional form was chosen as the best association structure. In particular, our joint model found that the area under the longitudinal profile of CD4 count was strongly associated with a 21% reduction in mortality (HR:0.79, 95% CI:0.72-0.86). Where as results from the time-varying Cox model showed a 19% reduction in mortality (HR:0.81, 95% CI:0.77-0.84). Conclusions In this paper we have shown that the “current value” association structure is not always the best structure that expresses the correct relationship between the outcomes in all settings, which is why it is crucial to explore alternative clinically meaningful association structures that links the longitudinal and survival processes.
Collapse
|
9
|
Dagne GA. Joint mixture quantile regressions and time-to-event analysis. BRAZ J PROBAB STAT 2022. [DOI: 10.1214/22-bjps537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
10
|
Devaux A, Genuer R, Peres K, Proust-Lima C. Individual dynamic prediction of clinical endpoint from large dimensional longitudinal biomarker history: a landmark approach. BMC Med Res Methodol 2022; 22:188. [PMID: 35818025 PMCID: PMC9275051 DOI: 10.1186/s12874-022-01660-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 06/15/2022] [Indexed: 11/16/2022] Open
Abstract
Background The individual data collected throughout patient follow-up constitute crucial information for assessing the risk of a clinical event, and eventually for adapting a therapeutic strategy. Joint models and landmark models have been proposed to compute individual dynamic predictions from repeated measures to one or two markers. However, they hardly extend to the case where the patient history includes much more repeated markers. Our objective was thus to propose a solution for the dynamic prediction of a health event that may exploit repeated measures of a possibly large number of markers. Methods We combined a landmark approach extended to endogenous markers history with machine learning methods adapted to survival data. Each marker trajectory is modeled using the information collected up to the landmark time, and summary variables that best capture the individual trajectories are derived. These summaries and additional covariates are then included in different prediction methods adapted to survival data, namely regularized regressions and random survival forests, to predict the event from the landmark time. We also show how predictive tools can be combined into a superlearner. The performances are evaluated by cross-validation using estimators of Brier Score and the area under the Receiver Operating Characteristic curve adapted to censored data. Results We demonstrate in a simulation study the benefits of machine learning survival methods over standard survival models, especially in the case of numerous and/or nonlinear relationships between the predictors and the event. We then applied the methodology in two prediction contexts: a clinical context with the prediction of death in primary biliary cholangitis, and a public health context with age-specific prediction of death in the general elderly population. Conclusions Our methodology, implemented in R, enables the prediction of an event using the entire longitudinal patient history, even when the number of repeated markers is large. Although introduced with mixed models for the repeated markers and methods for a single right censored time-to-event, the technique can be used with any other appropriate modeling technique for the markers and can be easily extended to competing risks setting. Supplementary Information The online version contains supplementary material available at (10.1186/s12874-022-01660-3).
Collapse
Affiliation(s)
| | - Robin Genuer
- INSERM, BPH, U1219, Univ. Bordeaux, Bordeaux, France.,INRIA Bordeaux Sud-Ouest, Talence, France
| | - Karine Peres
- INSERM, BPH, U1219, Univ. Bordeaux, Bordeaux, France
| | | |
Collapse
|
11
|
Zhang Z, Charalambous C, Foster P. Joint modelling of longitudinal measurements and survival times via a multivariate copula approach. J Appl Stat 2022; 50:2739-2759. [PMID: 37720246 PMCID: PMC10503460 DOI: 10.1080/02664763.2022.2081965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 05/21/2022] [Indexed: 10/18/2022]
Abstract
Joint modelling of longitudinal and time-to-event data is usually described by a joint model which uses shared or correlated latent effects to capture associations between the two processes. Under this framework, the joint distribution of the two processes can be derived straightforwardly by assuming conditional independence given the random effects. Alternative approaches to induce interdependency into sub-models have also been considered in the literature and one such approach is using copulas to introduce non-linear correlation between the marginal distributions of the longitudinal and time-to-event processes. The multivariate Gaussian copula joint model has been proposed in the literature to fit joint data by applying a Monte Carlo expectation-maximisation algorithm. In this paper, we propose an exact likelihood estimation approach to replace the more computationally expensive Monte Carlo expectation-maximisation algorithm and we consider results based on using both the multivariate Gaussian and t copula functions. We also provide a straightforward way to compute dynamic predictions of survival probabilities, showing that our proposed model is comparable in prediction performance to the shared random effects joint model.
Collapse
Affiliation(s)
- Zili Zhang
- Department of Mathematics, University of Manchester, Manchester, UK
| | | | - Peter Foster
- Department of Mathematics, University of Manchester, Manchester, UK
| |
Collapse
|
12
|
Ye W, Ding X, Putnam N, Farej R, Singh R, Wang D, Kuo S, Kong SX, Elliott JC, Lott J, Herman WH. Development of clinical prediction models for renal and cardiovascular outcomes and mortality in patients with type 2 diabetes and chronic kidney disease using time-varying predictors. J Diabetes Complications 2022; 36:108180. [PMID: 35339377 DOI: 10.1016/j.jdiacomp.2022.108180] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 03/10/2022] [Accepted: 03/11/2022] [Indexed: 11/25/2022]
Abstract
AIMS To develop a set of prediction models for end-stage kidney disease (ESKD), cardiovascular outcomes, and mortality in patients with type 2 diabetes (T2D) and chronic kidney disease (CKD) using commonly measured clinical variables. METHODS We studied 1432 participants with T2D and CKD enrolled in the Chronic Renal Insufficiency Cohort, followed for a median period of 7 years. We used Cox proportional-hazards models to model the six outcomes (ESKD, stroke, myocardial infarction (MI), congestive heart failure (CHF), death before ESKD, and all-cause mortality). We internally evaluated these models using concordance and calibration measures. RESULTS The newly developed six prediction models included 15 predictors: age at diabetes diagnosis, sex, blood pressure, body mass index, hemoglobin A1c, high density lipoprotein cholesterol, urine protein-to-creatinine ratio, estimated glomerular filtration rate, smoking status, and history of stroke, MI, CHF, ESKD, and amputation. The resulting models demonstrated good/strong discrimination (cross-validation C-index range: 0.70 to 0.90) and calibration. CONCLUSIONS This study provided an internally validated and useful tool for predicting individual adverse outcomes and mortality in patients with T2D and CKD. These models may inform optimal use of targeted health interventions.
Collapse
Affiliation(s)
- Wen Ye
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, United States of America.
| | - Xuemei Ding
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, United States of America
| | - Nathaniel Putnam
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, United States of America
| | - Ryan Farej
- Bayer HealthCare Pharmaceuticals Inc. (US), Bayer Boulevard Whippany, NJ, United States of America
| | - Rakesh Singh
- Bayer HealthCare Pharmaceuticals Inc. (US), Bayer Boulevard Whippany, NJ, United States of America
| | - Di Wang
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, United States of America
| | - Shihchen Kuo
- Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, United States of America
| | - Sheldon X Kong
- Bayer HealthCare Pharmaceuticals Inc. (US), Bayer Boulevard Whippany, NJ, United States of America
| | - Jay C Elliott
- Bayer HealthCare Pharmaceuticals Inc. (US), Bayer Boulevard Whippany, NJ, United States of America
| | - Jason Lott
- Bayer HealthCare Pharmaceuticals Inc. (US), Bayer Boulevard Whippany, NJ, United States of America
| | - William H Herman
- Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, United States of America
| |
Collapse
|
13
|
Signorelli M, Spitali P, Szigyarto CAK, Tsonaka R. Penalized regression calibration: A method for the prediction of survival outcomes using complex longitudinal and high-dimensional data. Stat Med 2021; 40:6178-6196. [PMID: 34464990 PMCID: PMC9293191 DOI: 10.1002/sim.9178] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 08/10/2021] [Accepted: 08/10/2021] [Indexed: 11/18/2022]
Abstract
Longitudinal and high‐dimensional measurements have become increasingly common in biomedical research. However, methods to predict survival outcomes using covariates that are both longitudinal and high‐dimensional are currently missing. In this article, we propose penalized regression calibration (PRC), a method that can be employed to predict survival in such situations. PRC comprises three modeling steps: First, the trajectories described by the longitudinal predictors are flexibly modeled through the specification of multivariate mixed effects models. Second, subject‐specific summaries of the longitudinal trajectories are derived from the fitted mixed models. Third, the time to event outcome is predicted using the subject‐specific summaries as covariates in a penalized Cox model. To ensure a proper internal validation of the fitted PRC models, we furthermore develop a cluster bootstrap optimism correction procedure that allows to correct for the optimistic bias of apparent measures of predictiveness. PRC and the CBOCP are implemented in the R package pencal, available from CRAN. After studying the behavior of PRC via simulations, we conclude by illustrating an application of PRC to data from an observational study that involved patients affected by Duchenne muscular dystrophy, where the goal is predict time to loss of ambulation using longitudinal blood biomarkers.
Collapse
Affiliation(s)
- Mirko Signorelli
- Mathematical Institute, Leiden University, Leiden, The Netherlands
| | - Pietro Spitali
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | | | | | - Roula Tsonaka
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
14
|
Mehdizadeh P, Baghfalaki T, Esmailian M, Ganjali M. A two-stage approach for joint modeling of longitudinal measurements and competing risks data. J Biopharm Stat 2021; 31:448-468. [PMID: 33905295 DOI: 10.1080/10543406.2021.1918142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/30/2022]
Abstract
Joint modeling of longitudinal measurements and time-to-event data is used in many practical studies of medical sciences. Most of the time, particularly in clinical studies and health inquiry, there are more than one event and they compete for failing an individual. In this situation, assessing the competing risk failure time is important. In most cases, implementation of joint modeling involves complex calculations. Therefore, we propose a two-stage method for joint modeling of longitudinal measurements and competing risks (JMLC) data based on the full likelihood approach via the conditional EM (CEM) algorithm. In the first stage, a linear mixed effect model is used to estimate the parameters of the longitudinal sub-model. In the second stage, we consider a cause-specific sub-model to construct competing risks data and describe an approximation for the joint log-likelihood that uses the estimated parameters of the first stage. We express the results of a simulation study and perform this method on the "standard and new anti-epileptic drugs" trial to check the effect of drug assaying on the treatment effects of lamotrigine and carbamazepine through treatment failure.
Collapse
Affiliation(s)
- P Mehdizadeh
- Department of Statistics and Computer Sciences, Faculty of Sciences, University of Mohaghegh Ardabili, Ardabil, Iran
| | - Taban Baghfalaki
- Department of Statistics, Faculty of Mathematical Sciences, Tarbiat Modares University, Tehran, Iran
| | - M Esmailian
- Department of Statistics and Computer Sciences, Faculty of Sciences, University of Mohaghegh Ardabili, Ardabil, Iran
| | - M Ganjali
- Department of Statistics, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran
| |
Collapse
|
15
|
Two-Stage Joint Model for Multivariate Longitudinal and Multistate Processes, with Application to Renal Transplantation Data. JOURNAL OF PROBABILITY AND STATISTICS 2021. [DOI: 10.1155/2021/6641602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
In longitudinal studies, clinicians usually collect longitudinal biomarkers’ measurements over time until an event such as recovery, disease relapse, or death occurs. Joint modeling approaches are increasingly used to study the association between one longitudinal and one survival outcome. However, in practice, a patient may experience multiple disease progression events successively. So instead of modeling of a single event, progression of the disease as a multistate process should be modeled. On the other hand, in such studies, multivariate longitudinal outcomes may be collected and their association with the survival process is of interest. In the present study, we applied a joint model of various longitudinal biomarkers and transitions between different health statuses in patients who underwent renal transplantation. The full joint likelihood approaches are faced with the complexities in computation of the likelihood. So, here, we have proposed two-stage modeling of multivariate longitudinal outcomes and multistate conditions to avoid these complexities. The proposed model showed reliable results compared to the joint model in case of joint modeling of univariate longitudinal biomarker and the multistate process.
Collapse
|
16
|
Methods to Account for Uncertainty in Latent Class Assignments When Using Latent Classes as Predictors in Regression Models, with Application to Acculturation Strategy Measures. Epidemiology 2021; 31:194-204. [PMID: 31809338 DOI: 10.1097/ede.0000000000001139] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Latent class models have become a popular means of summarizing survey questionnaires and other large sets of categorical variables. Often these classes are of primary interest to better understand complex patterns in data. Increasingly, these latent classes are reified into predictors of other outcomes of interests, treating the most likely class as the true class to which an individual belongs even though there is uncertainty in class membership. This uncertainty can be viewed as a form of measurement error in predictors, leading to bias in the estimates of the regression parameters associated with the latent classes. Despite this fact, there is very limited literature treating latent class predictors as measurement error models. Most applications ignore this issue and fit a two-stage model that treats the modal class prediction as truth. Here, we develop two approaches-one likelihood-based, the other Bayesian-to implement a joint model for latent class analysis and outcome prediction. We apply these methods to an analysis of how acculturation behaviors predict depression in South Asian immigrants to the United States. A simulation study gives guidance for when a two-stage model can be safely implemented and when the joint model may be required.
Collapse
|
17
|
Ngwa JS, Cabral HJ, Cheng DM, Gagnon DR, LaValley MP, Cupples LA. Revisiting methods for modeling longitudinal and survival data: Framingham Heart Study. BMC Med Res Methodol 2021; 21:29. [PMID: 33568059 PMCID: PMC7876802 DOI: 10.1186/s12874-021-01207-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 01/13/2021] [Indexed: 11/27/2022] Open
Abstract
Background Statistical methods for modeling longitudinal and time-to-event data has received much attention in medical research and is becoming increasingly useful. In clinical studies, such as cancer and AIDS, longitudinal biomarkers are used to monitor disease progression and to predict survival. These longitudinal measures are often missing at failure times and may be prone to measurement errors. More importantly, time-dependent survival models that include the raw longitudinal measurements may lead to biased results. In previous studies these two types of data are frequently analyzed separately where a mixed effects model is used for the longitudinal data and a survival model is applied to the event outcome. Methods In this paper we compare joint maximum likelihood methods, a two-step approach and a time dependent covariate method that link longitudinal data to survival data with emphasis on using longitudinal measures to predict survival. We apply a Bayesian semi-parametric joint method and maximum likelihood joint method that maximizes the joint likelihood of the time-to-event and longitudinal measures. We also implement the Two-Step approach, which estimates random effects separately, and a classic Time Dependent Covariate Model. We use simulation studies to assess bias, accuracy, and coverage probabilities for the estimates of the link parameter that connects the longitudinal measures to survival times. Results Simulation results demonstrate that the Two-Step approach performed best at estimating the link parameter when variability in the longitudinal measure is low but is somewhat biased downwards when the variability is high. Bayesian semi-parametric and maximum likelihood joint methods yield higher link parameter estimates with low and high variability in the longitudinal measure. The Time Dependent Covariate method resulted in consistent underestimation of the link parameter. We illustrate these methods using data from the Framingham Heart Study in which lipid measurements and Myocardial Infarction data were collected over a period of 26 years. Conclusions Traditional methods for modeling longitudinal and survival data, such as the time dependent covariate method, that use the observed longitudinal data, tend to provide downwardly biased estimates. The two-step approach and joint models provide better estimates, although a comparison of these methods may depend on the underlying residual variance. Supplementary Information The online version contains supplementary material available at 10.1186/s12874-021-01207-y.
Collapse
Affiliation(s)
- Julius S Ngwa
- Department of Biostatistics, School of Public Health, Boston University, 801 Massachusetts Ave, CT 3rd Floor, Boston, MA, 02118, USA. .,Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, 615 North Wolfe St E3009, Baltimore, MD, 21205, USA.
| | - Howard J Cabral
- Department of Biostatistics, School of Public Health, Boston University, 801 Massachusetts Ave, CT 3rd Floor, Boston, MA, 02118, USA
| | - Debbie M Cheng
- Department of Biostatistics, School of Public Health, Boston University, 801 Massachusetts Ave, CT 3rd Floor, Boston, MA, 02118, USA
| | - David R Gagnon
- Department of Biostatistics, School of Public Health, Boston University, 801 Massachusetts Ave, CT 3rd Floor, Boston, MA, 02118, USA
| | - Michael P LaValley
- Department of Biostatistics, School of Public Health, Boston University, 801 Massachusetts Ave, CT 3rd Floor, Boston, MA, 02118, USA
| | - L Adrienne Cupples
- Department of Biostatistics, School of Public Health, Boston University, 801 Massachusetts Ave, CT 3rd Floor, Boston, MA, 02118, USA. .,National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA.
| |
Collapse
|
18
|
Azarbar A, Wang Y, Nadarajah S. Simultaneous Bayesian modeling of longitudinal and survival data in breast cancer patients. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2019.1635701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Affiliation(s)
- Ali Azarbar
- Department of Statistics, Amirkabir University of Technology, Tehran, Iran
| | - Yu Wang
- School of Mathematics, University of Manchester, Manchester, UK
| | | |
Collapse
|
19
|
Gupta R, Khoury JC, Altaye M, Jandarov R, Szczesniak RD. Flexible multivariate joint model of longitudinal intensity and binary process for medical monitoring of frequently collected data. Stat Med 2021; 40:1845-1858. [PMID: 33426642 DOI: 10.1002/sim.8875] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 12/22/2020] [Accepted: 12/23/2020] [Indexed: 11/08/2022]
Abstract
A frequent problem in longitudinal studies is that data may be assessed at subject-selected, irregularly spaced time-points, resulting in highly unbalanced outcome data, inducing bias, especially if availability of data is directly related to outcome. Our aim was to develop a multivariate joint model in a mixed outcomes framework to minimize irregular sampling bias. We demonstrate using blood glucose monitoring throughout pregnancy and risk of preterm birth among women with type 1 diabetes mellitus. Blood glucose measurements were unequally spaced and intensity of sampling varied between and within individuals over time. Multivariate linear mixed effects submodel for the longitudinal outcome (blood glucose), Poisson model for the intensity of glucose sampling, and logistic regression model for binary process (preterm birth) were specified. Association between models is captured through shared random effects. Markov chain Monte Carlo methods were used to fit the model. The multivariate joint model provided better prediction, compared with a joint model with a multivariate linear mixed effects submodel (ignoring intensity of glucose sampling) and a two-stage model. Most association parameters were significant in the preterm birth outcome model, signifying improvement of predictive ability of the binary endpoint by sharing random effects between glucose monitoring and preterm birth. A simulation study is presented to illustrate the effectiveness of the multivariate joint modeling approach.
Collapse
Affiliation(s)
- Resmi Gupta
- Biostatistics and Epidemiology/Research Design Component, Division of Clinical and Translational Sciences, Department of Internal Medicine, University of Texas McGovern Medical School, Houston, Texas, USA
| | - Jane C Khoury
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| | - Mekibib Altaye
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| | - Roman Jandarov
- Department of Biostatistics, University of Cincinnati, Cincinnati, Ohio, USA
| | - Rhonda D Szczesniak
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| |
Collapse
|
20
|
Leiva-Yamaguchi V, Alvares D. A Two-Stage Approach for Bayesian Joint Models of Longitudinal and Survival Data: Correcting Bias with Informative Prior. ENTROPY 2020; 23:e23010050. [PMID: 33396212 PMCID: PMC7824570 DOI: 10.3390/e23010050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 12/21/2020] [Accepted: 12/27/2020] [Indexed: 11/28/2022]
Abstract
Joint models of longitudinal and survival outcomes have gained much popularity in recent years, both in applications and in methodological development. This type of modelling is usually characterised by two submodels, one longitudinal (e.g., mixed-effects model) and one survival (e.g., Cox model), which are connected by some common term. Naturally, sharing information makes the inferential process highly time-consuming. In particular, the Bayesian framework requires even more time for Markov chains to reach stationarity. Hence, in order to reduce the modelling complexity while maintaining the accuracy of the estimates, we propose a two-stage strategy that first fits the longitudinal submodel and then plug the shared information into the survival submodel. Unlike a standard two-stage approach, we apply a correction by incorporating an individual and multiplicative fixed-effect with informative prior into the survival submodel. Based on simulation studies and sensitivity analyses, we empirically compare our proposal with joint specification and standard two-stage approaches. The results show that our methodology is very promising, since it reduces the estimation bias compared to the other two-stage method and requires less processing time than the joint specification approach.
Collapse
|
21
|
Nevo D, Hamada T, Ogino S, Wang M. A novel calibration framework for survival analysis when a binary covariate is measured at sparse time points. Biostatistics 2020; 21:e148-e163. [PMID: 30380012 DOI: 10.1093/biostatistics/kxy063] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2018] [Revised: 08/04/2018] [Accepted: 10/02/2018] [Indexed: 01/29/2023] Open
Abstract
The goals in clinical and cohort studies often include evaluation of the association of a time-dependent binary treatment or exposure with a survival outcome. Recently, several impactful studies targeted the association between initiation of aspirin and survival following colorectal cancer (CRC) diagnosis. The value of this exposure is zero at baseline and may change its value to one at some time point. Estimating this association is complicated by having only intermittent measurements on aspirin-taking. Commonly used methods can lead to substantial bias. We present a class of calibration models for the distribution of the time of status change of the binary covariate. Estimates obtained from these models are then incorporated into the proportional hazard partial likelihood in a natural way. We develop non-parametric, semiparametric, and parametric calibration models, and derive asymptotic theory for the methods that we implement in the aspirin and CRC study. We further develop a risk-set calibration approach that is more useful in settings in which the association between the binary covariate and survival is strong.
Collapse
Affiliation(s)
- Daniel Nevo
- Departments of Biostatistics and Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Tsuyoshi Hamada
- Department of Oncologic Pathology, Dana-Farber Cancer Institute and Harvard Medical School Boston, MA, USA
| | - Shuji Ogino
- Department of Oncologic Pathology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.,Program in MPE Molecular Pathological Epidemiology, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Molin Wang
- Departments of Biostatistics and Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.,Channing Division of Network & Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
22
|
Zheng Y, Zhao X, Zhang X. A novel approach to estimate the Cox model with temporal covariates and application to medical cost data. COMMUN STAT-THEOR M 2020. [DOI: 10.1080/03610926.2019.1602651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Yanqiao Zheng
- School of Finance, Zhejiang University of Finance and Economics, Hangzhou, China
| | - Xiaobing Zhao
- School of Data Science, Zhejiang University of Finance and Economics Hangzhou, China
| | - Xiaoqi Zhang
- School of Finance, Zhejiang University of Finance and Economics, Hangzhou, China
| |
Collapse
|
23
|
Zhang F, Chen MH, Cong XJ, Chen Q. Assessing Importance of Biomarkers: a Bayesian Joint Modeling Approach of Longitudinal and Survival Data with Semicompeting Risks. STAT MODEL 2020; 21:30-55. [PMID: 34326706 DOI: 10.1177/1471082x20933363] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Longitudinal biomarkers such as patient-reported outcomes (PROs) and quality of life (QOL) are routinely collected in cancer clinical trials or other studies. Joint modeling of PRO/QOL and survival data can provide a comparative assessment of patient-reported changes in specific symptoms or global measures that correspond to changes in survival. Motivated by a head and neck cancer clinical trial, we develop a class of trajectory-based models for longitudinal and survival data with disease progression. Specifically, we propose a class of mixed effects regression models for longitudinal measures, a cure rate model for the disease progression time (T P ), and a Cox proportional hazards model with time-varying covariates for the overall survival time (T D ) to account for T P and treatment switching. Under the semi-competing risks framework, the disease progression is the nonterminal event, the occurrence of which is subject to a terminal event of death. The properties of the proposed models are examined in detail. Within the Bayesian paradigm, we derive the decompositions of the deviance information criterion (DIC) and the logarithm of the pseudo marginal likelihood (LPML) to assess the fit of the longitudinal component of the model and the fit of each survival component, separately. We further develop ΔDIC as well as ΔLPML to determine the importance and contribution of the longitudinal data to the model fit of the T P and T D data.
Collapse
Affiliation(s)
| | - Ming-Hui Chen
- Department of Statistics, University of Connecticut, Storrs, CT, USA
| | | | - Qingxia Chen
- Department of Biostatistics, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
24
|
Zeki Al Hazzouri A, Vittinghoff E, Zhang Y, Pletcher MJ, Moran AE, Bibbins-Domingo K, Golden SH, Yaffe K. Use of a pooled cohort to impute cardiovascular disease risk factors across the adult life course. Int J Epidemiol 2020; 48:1004-1013. [PMID: 30535320 DOI: 10.1093/ije/dyy264] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/10/2018] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND In designing prevention strategies, it may be useful to understand how early and midlife cardiovascular disease risk factor (CVDRF) exposures affect outcomes that primarily occur in mid to late life. Few single US cohorts have followed participants from early adulthood to late life. METHODS We pooled four prospective cohorts that represent segments of the adult life course, and studied 15 001 White and Black adults aged 18 to 95 years at enrollment. We imputed early and midlife exposure to body mass index (BMI), glucose, lipids and blood pressure (BP). CVDRF trajectories were estimated using linear mixed models. Using the best linear unbiased predictions, we obtained person-specific estimates of CVDRF trajectories beginning at age 20 until each participant's end of follow-up. We then calculated for each CVDRF, summary measures of early and midlife exposure as time-weighted averages (TWAs). RESULTS In the pooled cohort, 33.7% were Black and 54.8% were female. CVDRF summary measures worsened in midlife compared with early life and varied by sex and race. In particular, systolic and diastolic BP were consistently higher over the adult life course among men, and BMI was higher among Blacks, particularly Black women. Simulation studies suggested acceptable imputation accuracy, especially for the younger cohorts. Correlations of true and imputed CVDRF summary measures ranged from 0.53 to 0.99, and agreement ranged from 67% to 99%. CONCLUSIONS These results suggest that imputed CVDRFs may be accurate enough to be useful in assessing the effects of early and midlife exposures on later life outcomes.
Collapse
Affiliation(s)
| | - Eric Vittinghoff
- Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA, USA
| | - Yiyi Zhang
- Division of General Medicine, Columbia University Medical Center, New York, NY, USA
| | - Mark J Pletcher
- Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA, USA
| | - Andrew E Moran
- Division of General Medicine, Columbia University Medical Center, New York, NY, USA
| | - Kirsten Bibbins-Domingo
- Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA, USA
| | - Sherita H Golden
- Department of Medicine, Johns Hopkins University School of Medicine, Department of Epidemiology, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kristine Yaffe
- Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA, USA.,Department of Psychiatry, University of California San Francisco, San Francisco Veterans Affairs Medical Center, San Francisco, CA, USA.,Department of Neurology, University of California San Francisco, San Francisco Veterans Affairs Medical Center, San Francisco, CA, USA
| |
Collapse
|
25
|
Baghfalaki T, Kalantari S, Ganjali M, Hadaegh F, Pahlavanzadeh B. Bayesian joint modeling of ordinal longitudinal measurements and competing risks survival data for analysing Tehran Lipid and Glucose Study. J Biopharm Stat 2020; 30:689-703. [DOI: 10.1080/10543406.2020.1730876] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Taban Baghfalaki
- Department of Statistics, Faculty of Mathematical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Shiva Kalantari
- Department of Statistics, Faculty of Mathematical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Mojtaba Ganjali
- Department of Statistics, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran
| | - Farzad Hadaegh
- Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Bagher Pahlavanzadeh
- Department of Community Medicine and Health, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
26
|
Franco Soto DC, Pedroso de Lima AC, Da Motta Singer J. A Birnbaum-Saunders Model for Joint Survival and Longitudinal Analysis of Congestive Heart Failure Data. REVISTA COLOMBIANA DE ESTADÍSTICA 2020. [DOI: 10.15446/rce.v43n1.77851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
We consider a parametric joint modelling of longitudinal measurements and survival times, motivated by a study conducted at the Heart Institute (Incor), São Paulo, Brazil, with the objective of evaluating the impact of B-type Natriuretic Peptide (BNP) collected at different instants on the survival of patients with Congestive Heart Failure (CHF). We employ a linear mixed model for the longitudinal response and a Birnbaum-Saunders model for the survival times, allowing the inclusion of subjects without longitudinal observations. We derive maximum likelihood estimators of the joint model parameters and conduct a simulation study to compare the true survival probabilities with dynamic predictions obtained from the fit of the proposed joint model and to evaluate the performance of the method for estimating the model parameters.The proposed joint model is applied to the cohort of 1609 patients with CHF, of which 1080 have no BNP measurements. The parameter estimates and their standard errors obtained via: i) the traditional approach, where only individuals with at least one measurement of the longitudinal response are included and ii) the proposed approach, which includes survival information from all individuals, are compared with those obtained via marginal (longitudinal and survival) models.
Collapse
|
27
|
Gupta R, Khoury JC, Altaye M, Jandarov R, Szczesniak RD. Assessing the Relationship between Gestational Glycemic Control and Risk of Preterm Birth in Women with Type 1 Diabetes: A Joint Modeling Approach. J Diabetes Res 2020; 2020:3074532. [PMID: 32685553 PMCID: PMC7333058 DOI: 10.1155/2020/3074532] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 05/19/2020] [Accepted: 06/10/2020] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Characterizing maternal glucose sampling over the course of the entire pregnancy is an important step toward improvement in prediction of adverse birth outcome, such as preterm birth, for women with type 1 diabetes mellitus (T1DM). OBJECTIVES To characterize the relationship between the gestational glycemic profile and risk of preterm birth using a joint modeling approach. METHODS A joint model was developed to simultaneously characterize the relationship between a longitudinal outcome (daily blood glucose sampling) and an event process (preterm birth). A linear mixed effects model using natural cubic splines was fitted to predict the longitudinal submodel. Covariates included mother's age at last menstrual period, age at diabetes onset, body mass index, hypertension, retinopathy, and nephropathy. Various association structures (value, value plus slope, and area under the curve) were examined before selecting the final joint model. We compared the joint modeling approach to the time-dependent Cox model (TDCM). RESULTS A total of 16,480 glucose readings over gestation (range: 50-260 days) with 32 women (28%) having preterm birth was included in the study. Mother's age at last menstrual period and age at diabetes onset were statistically significant (beta = 1.29, 95% CI 1.10, 1.72; beta = 0.84, 95% CI 0.62, 0.98) for the longitudinal submodel, reflecting that older women tended to have higher mean blood glucose and those with later diabetes onset tended to have a lower mean blood glucose level. The presence of nephropathy was statistically significant in the event submodel (beta = 2.29, 95% CI 1.05, 4.48). Cumulative association parameterization provided the best joint model fit. The joint model provided better fit compared to the time-dependent Cox model (DIC (JM) = 19,895; DIC (TDCM) = 19,932). CONCLUSION The joint model approach was able to simultaneously characterize the glycemic profile and assess the risk of preterm birth and provided additional insights and a better model fit compared to the time-dependent Cox model.
Collapse
Affiliation(s)
- Resmi Gupta
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| | - Jane C. Khoury
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| | - Mekibib Altaye
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| | - Roman Jandarov
- Department of Biostatistics, University of Cincinnati, Cincinnati, Ohio, USA
| | - Rhonda D. Szczesniak
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital, Cincinnati, Ohio, USA
| |
Collapse
|
28
|
Murad H, Dankner R, Berlin A, Olmer L, Freedman LS. Imputing missing time-dependent covariate values for the discrete time Cox model. Stat Methods Med Res 2019; 29:2074-2086. [PMID: 31680633 DOI: 10.1177/0962280219881168] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We describe a procedure for imputing missing values of time-dependent covariates in a discrete time Cox model using the chained equations method. The procedure multiply imputes the missing values for each time-period in a time-sequential manner, using covariates from the current and previous time-periods as well as the survival outcome. The form of the outcome variable used in the imputation model depends on the functional form of the time-dependent covariate(s) and differs from the case of Cox regression with only baseline covariates. This time-sequential approach provides an approximation to a fully conditional approach. We illustrate the procedure with data on diabetics, evaluating the association of their glucose control with the risk of selected cancers. Using simulations we show that the suggested estimator performed well (in terms of bias and coverage) for completely missing at random, missing at random and moderate non-missing-at-random patterns. However, for very strong non-missing-at-random patterns, the estimator was seriously biased and the coverage was too low. The procedure can be implemented using multiple imputation with the Fully conditional Specification (FCS) method (MI procedure in SAS with FCS statement or similar packages in other software, e.g. MICE in R). For use with event times on a continuous scale, the events would need to be grouped into time-intervals.
Collapse
Affiliation(s)
- Havi Murad
- Biostatistics and Biomathematics Unit, Gertner Institute, Sheba Medical Center, Tel-Hashomer, Israel
| | - Rachel Dankner
- Cardiovascular Epidemiology Unit, Gertner Institute, Sheba Medical Center, Tel-Hashomer, Israel.,Department of Epidemiology and Preventive Medicine, Sackler Faculty of Medicine, School of Public Health, Tel-Aviv University, Tel-Aviv, Israel
| | - Alla Berlin
- Cardiovascular Epidemiology Unit, Gertner Institute, Sheba Medical Center, Tel-Hashomer, Israel
| | - Liraz Olmer
- Biostatistics and Biomathematics Unit, Gertner Institute, Sheba Medical Center, Tel-Hashomer, Israel
| | - Laurence S Freedman
- Biostatistics and Biomathematics Unit, Gertner Institute, Sheba Medical Center, Tel-Hashomer, Israel
| |
Collapse
|
29
|
Moreno-Betancur M, Carlin JB, Brilleman SL, Tanamas SK, Peeters A, Wolfe R. Survival analysis with time-dependent covariates subject to missing data or measurement error: Multiple Imputation for Joint Modeling (MIJM). Biostatistics 2019; 19:479-496. [PMID: 29040396 DOI: 10.1093/biostatistics/kxx046] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Accepted: 08/22/2017] [Indexed: 11/14/2022] Open
Abstract
Modern epidemiological studies collect data on time-varying individual-specific characteristics, such as body mass index and blood pressure. Incorporation of such time-dependent covariates in time-to-event models is of great interest, but raises some challenges. Of specific concern are measurement error, and the non-synchronous updating of covariates across individuals, due for example to missing data. It is well known that in the presence of either of these issues the last observation carried forward (LOCF) approach traditionally used leads to bias. Joint models of longitudinal and time-to-event outcomes, developed recently, address these complexities by specifying a model for the joint distribution of all processes and are commonly fitted by maximum likelihood or Bayesian approaches. However, the adequate specification of the full joint distribution can be a challenging modeling task, especially with multiple longitudinal markers. In fact, most available software packages are unable to handle more than one marker and offer a restricted choice of survival models. We propose a two-stage approach, Multiple Imputation for Joint Modeling (MIJM), to incorporate multiple time-dependent continuous covariates in the semi-parametric Cox and additive hazard models. Assuming a primary focus on the time-to-event model, the MIJM approach handles the joint distribution of the markers using multiple imputation by chained equations, a computationally convenient procedure that is widely available in mainstream statistical software. We developed an R package "survtd" that allows MIJM and other approaches in this manuscript to be applied easily, with just one call to its main function. A simulation study showed that MIJM performs well across a wide range of scenarios in terms of bias and coverage probability, particularly compared with LOCF, simpler two-stage approaches, and a Bayesian joint model. The Framingham Heart Study is used to illustrate the approach.
Collapse
Affiliation(s)
- Margarita Moreno-Betancur
- Department of Epidemiology and Preventive Medicine, Monash University, 99 Commercial Rd, Melbourne, VIC, Australia.,Clinical Epidemiology and Biostatistics Unit, Murdoch Childrens Research Institute, 50 Flemington Rd, Parkville, VIC, Australia
| | - John B Carlin
- Clinical Epidemiology and Biostatistics Unit, Murdoch Childrens Research Institute, Parkville, Australia.,Melbourne School of Population and Global Health, University of Melbourne, Carlton, Australia
| | - Samuel L Brilleman
- Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, Australia
| | | | - Anna Peeters
- School of Health and Social Development, Deakin University, Burwood, Australia
| | - Rory Wolfe
- Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, Australia
| |
Collapse
|
30
|
Han Y, Liu D. Accounting for random observation time in risk prediction with longitudinal markers: An imputation approach. Stat Methods Med Res 2019; 29:396-412. [PMID: 30854937 DOI: 10.1177/0962280219833089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Longitudinally measured biomarkers are useful to predict the risk of clinical endpoints, since subject-specific marker trajectory contains additional information on pathology and critical windows. The work is motivated by the Scandinavian Fetal Growth Study, aiming at predicting pregnancy outcomes with repeated ultrasound measurements during pregnancy. While the observation time of markers often varies across individuals, it is not well understood how the variations affect risk prediction. Existing methods of longitudinal risk prediction, such as shared random effects model and pattern mixture model, construct a prediction implicitly as a function of the biomarkers and their observation time. Methods that ignore the longitudinal structure, such as sufficient dimension reduction and logistic regression, have better interpretability regarding how a biomarker measured at specific time window contributes to the disease risk, but often have reduced accuracy because of ignoring the observation time information. We propose a novel imputation approach to handle the random observation time, while preserving the direct interpretation. Through extensive simulation studies and analyses of the Scandinavian Fetal Growth Study data, we systematically compared the discrimination and calibration performance of different risk prediction methods, and found that the imputation method has comparable performance to longitudinal methods with an advantage of better interpretability.
Collapse
Affiliation(s)
- Yongli Han
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| | - Danping Liu
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| |
Collapse
|
31
|
Lee J, Thall PF, Lin SH. Bayesian Semiparametric Joint Regression Analysis of Recurrent Adverse Events and Survival in Esophageal Cancer Patients. Ann Appl Stat 2019; 13:221-247. [PMID: 31681453 PMCID: PMC6824476 DOI: 10.1214/18-aoas1182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2023]
Abstract
We propose a Bayesian semiparametric joint regression model for a recurrent event process and survival time. Assuming independent latent subject frailties, we define marginal models for the recurrent event process intensity and survival distribution as functions of the subject's frailty and baseline covariates. A robust Bayesian model, called Joint-DP, is obtained by assuming a Dirichlet process for the frailty distribution. We present a simulation study that compares posterior estimates under the Joint-DP model to a Bayesian joint model with lognormal frailties, a frequentist joint model, and marginal models for either the recurrent event process or survival time. The simulations show that the Joint-DP model does a good job of correcting for treatment assignment bias, and has favorable estimation reliability and accuracy compared with the alternative models. The Joint-DP model is applied to analyze an observational dataset from esophageal cancer patients treated with chemo-radiation, including the times of recurrent effusions of fluid to the heart or lungs, survival time, prognostic covariates, and radiation therapy modality.
Collapse
Affiliation(s)
- Juhee Lee
- Department of Applied Mathematics and Statistics, University California Santa Cruz, Santa Cruz, CA
| | | | - Steven H. Lin
- Department of Radiation Oncology, M.D. Anderson, Huston, TX
| |
Collapse
|
32
|
Alonso EM, Ye W, Hawthorne K, Venkat V, Loomes KM, Mack CL, Hertel PM, Karpen SJ, Kerkar N, Molleston JP, Murray KF, Romero R, Rosenthal P, Schwarz KB, Shneider BL, Suchy FJ, Turmelle YP, Wang KS, Sherker AH, Sokol RJ, Bezerra JA, Magee JC. Impact of Steroid Therapy on Early Growth in Infants with Biliary Atresia: The Multicenter Steroids in Biliary Atresia Randomized Trial. J Pediatr 2018; 202:179-185.e4. [PMID: 30244988 PMCID: PMC6365098 DOI: 10.1016/j.jpeds.2018.07.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Revised: 06/06/2018] [Accepted: 07/02/2018] [Indexed: 01/08/2023]
Abstract
OBJECTIVE To investigate the impact of corticosteroid therapy on the growth of participants in the Steroids in Biliary Atresia Randomized Trial (START) conducted through the Childhood Liver Disease Research Network. The primary analysis in START indicated that steroids did not have a beneficial effect on drainage in a cohort of infants with biliary atresia. We hypothesized that steroids would have a detrimental effect on growth in these infants. STUDY DESIGN A total of 140 infants were enrolled in START, with 70 randomized to each treatment arm: steroid and placebo. Length, weight, and head circumference were obtained at baseline and follow-up visits to 24 months of age. RESULTS Patients treated with steroids had significantly lower length and head circumference z scores during the first 3 months post-hepatoportoenterostomy (HPE), and significantly lower weight until 12 months. Growth trajectories in the steroid and placebo arms differed significantly for length (P < .0001), weight (P = .009), and head circumference (P < .0001) with the largest impact noted for those with successful HPE. Growth trajectory for head circumference was significantly lower in patients treated with steroids irrespective of HPE status, but recovered during the second 6 months of life. CONCLUSIONS Steroid therapy following HPE in patients with biliary atresia is associated with impaired length, weight, and head circumference growth trajectories for at least 6 months post-HPE, especially impacting infants with successful bile drainage. TRIAL REGISTRATION ClinicalTrials.gov: NCT00294684.
Collapse
Affiliation(s)
- Estella M Alonso
- Division of Gastroenterology, Hepatology, and Nutrition, Ann and Robert H. Lurie Children's Hospital of Chicago, Northwestern University Feinberg School of Medicine, Chicago, IL.
| | - Wen Ye
- Department of Biostatistics, University of Michigan, Ann Arbor, MI
| | | | - Veena Venkat
- Children's Hospital of Pittsburgh, Pittsburgh, PA
| | - Kathleen M Loomes
- Division of Gastroenterology, Hepatology, and Nutrition, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Cara L Mack
- Section of Pediatric Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, University of Colorado School of Medicine, Children's Hospital Colorado, Aurora, CO
| | - Paula M Hertel
- Pediatric Gastroenterology, Hepatology and Nutrition, Baylor College of Medicine, Houston, TX
| | - Saul J Karpen
- Division of Pediatric Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, Emory University School of Medicine/Children's Healthcare of Atlanta, Atlanta, GA
| | - Nanda Kerkar
- Division of Pediatric Gastroenterology, Hepatology and Nutrition, Children's Hospital Los Angeles, University of Southern California, Los Angeles, CA; Division of Pediatric Gastroenterology, The Mount Sinai School of Medicine, New York, NY
| | - Jean P Molleston
- Section of Pediatric Gastroenterology, Hepatology, and Nutrition, Indiana University School of Medicine, Rylie Hospital for Children, Indianapolis, IN
| | - Karen F Murray
- Division of Gastroenterology and Hepatology, Department of Pediatrics, University of Washington and Seattle Children's, Seattle, WA
| | - Rene Romero
- Pediatrics, Emory University School of Medicine/Children's Healthcare of Atlanta, Atlanta, GA
| | - Philip Rosenthal
- Division of Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, University of California, San Francisco Benioff Children's Hospital, San Francisco, CA
| | | | - Benjamin L Shneider
- Division of Pediatric Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, Baylor College of Medicine, Houston, TX
| | - Frederick J Suchy
- Children's Hospital Research Institute, Children's Hospital Colorado, Aurora, CO
| | | | - Kasper S Wang
- Division of Pediatric Gastroenterology, Hepatology and Nutrition, Children's Hospital Los Angeles, University of Southern California, Los Angeles, CA
| | - Averell H Sherker
- Liver Diseases Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD
| | - Ronald J Sokol
- Section of Pediatric Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, University of Colorado School of Medicine, Children's Hospital Colorado, Aurora, CO
| | - Jorge A Bezerra
- Division of Pediatric Gastroenterology, Hepatology, and Nutrition, Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - John C Magee
- Department of Surgery, University of Michigan Medical School, Ann Arbor, MI
| |
Collapse
|
33
|
Huong PTT, Nur D, Pham H, Branford A. A modified two-stage approach for joint modelling of longitudinal and time-to-event data. J STAT COMPUT SIM 2018. [DOI: 10.1080/00949655.2018.1518449] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Pham Thi Thu Huong
- School of Mathematics, An Giang University, Long Xuyen, An Giang, Vietnam
| | - Darfiana Nur
- School of Computer Science, Engineering and Mathematics, Flinders University, Adelaide, South Australia, Australia
| | - Hoa Pham
- School of Mathematics, An Giang University, Long Xuyen, An Giang, Vietnam
| | - Alan Branford
- School of Computer Science, Engineering and Mathematics, Flinders University, Adelaide, South Australia, Australia
| |
Collapse
|
34
|
Elmi AF, Grantz KL, Albert PS. An approximate joint model for multiple paired longitudinal outcomes and time-to-event data. Biometrics 2018; 74:1112-1119. [PMID: 29492955 PMCID: PMC7592178 DOI: 10.1111/biom.12862] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Revised: 12/01/2018] [Accepted: 01/01/2018] [Indexed: 11/27/2022]
Abstract
Joint modeling of multivariate paired longitudinal data and time-to-event data presents computational challenges that supersede full likelihood estimation due to the large dimensional random effects vector needed to capture correlation due to clustering with respect to pairs, subjects, and outcomes. We propose an alternative, computationally simpler approach to estimation of complex shared parameter models where missing data is imputed based on the Posterior Predictive Distribution from a Conditional Linear Model (CLM) approximation. Existing methods for complete data are then implemented to obtain estimates of the event time model parameters. Our method is applied to examine the effects of discordant growth in anthropometric measures of longitudinal fetal growth in twin fetuses and the timing of birth. Simulation results are presented to show that our method performs relatively well with moderate measurement errors under certain CLM approximations.
Collapse
Affiliation(s)
- Angelo F. Elmi
- Department of Epidemiology and Biostatistics, The Milken Institute School of Public Health at The George Washington University, Washington, D.C. 20052, U.S.A
| | - Katherine L. Grantz
- Epidemiology Branch, Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland 20817, U.S.A
| | - Paul S. Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland 20850, U.S.A
| |
Collapse
|
35
|
Dagne GA. Heterogeneous growth bent-cable models for time-to-event and longitudinal data: application to AIDS studies. J Biopharm Stat 2018; 28:1216-1230. [PMID: 29953318 DOI: 10.1080/10543406.2018.1489407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
The major limitations of growth curve mixture models for HIV/AIDS data are the usual assumptions of normality and monophasic curves within latent classes. This article addresses these limitations by using non-normal skewed distributions and multiphasic patterns for outcomes of prospective studies. For such outcomes, new skew-t (ST) distributions are proposed for modeling heterogeneous growth trajectories, which exhibit not abrupt but gradual multiphasic changes from a declining trend to an increasing trend over time. We assess these clinically important features of longitudinal HIV/AIDS data using the bent-cable framework within a context of a joint modeling of time-to-event process and response process. A real dataset from an AIDS clinical study is used to illustrate the proposed methods.
Collapse
Affiliation(s)
- Getachew A Dagne
- a Department of Epidemiology & Biostatistics, College of Public Health, MDC 56 , University of South Florida , Tampa , FL , USA
| |
Collapse
|
36
|
Guler I, Faes C, Cadarso-Suárez C, Teixeira L, Rodrigues A, Mendonça D. Two-stage model for multivariate longitudinal and survival data with application to nephrology research. Biom J 2018; 59:1204-1220. [PMID: 29139606 DOI: 10.1002/bimj.201600244] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Revised: 09/08/2017] [Accepted: 09/29/2017] [Indexed: 11/07/2022]
Abstract
In many follow-up studies different types of outcomes are collected including longitudinal measurements and time-to-event outcomes. Commonly, it is of interest to study the association between them. Joint modeling approaches of a single longitudinal outcome and survival process have recently gained increasing attention from both frequentist and Bayesian perspective. However, in many studies several longitudinal biomarkers are of interest and instead of selecting one single biomarker, the relationships between all these outcomes and their association with survival needs to be investigated. Our motivating study comes from Peritoneal Dialysis Programme in Nephrology research from Nephrology Unit, CHP (Hospital de Santo António), Porto, Portugal in which the interest relies on the possible association between various biomarkers (calcium, phosphate, parathormone, and creatinine) and the patients' survival. To this aim, we propose a two-stage model-based approach for multivariate longitudinal and survival data that allowed us to study such complex association structure. The multivariate model suggested in this paper provided new insights in the area of nephrology research showing valid results in comparison with those models studying each longitudinal biomarker with survival separately.
Collapse
Affiliation(s)
- Ipek Guler
- Center for Research in Molecular Medicine and Chronic Diseases (CiMUS), University of Santiago de Compostela, 15782, Santiago de Compostela, A Coruna, Spain
| | - Christel Faes
- I-Biostat, Hasselt University, BE3590, Diepenbeek, Belgium
| | - Carmen Cadarso-Suárez
- Center for Research in Molecular Medicine and Chronic Diseases (CiMUS), University of Santiago de Compostela, 15782, Santiago de Compostela, A Coruna, Spain
| | - Laetitia Teixeira
- Instituto de Ciencias Biomédicas Abel Salazar, Universidade do Porto, Porto, Portugal
- CINTESIS, Instituto de Ciencias Biomédicas Abel Salazar, Universidade do Porto, Porto, Portugal
| | - Anabela Rodrigues
- Instituto de Ciencias Biomédicas Abel Salazar, Universidade do Porto, Porto, Portugal
- Centro Hospitalar do Porto, Hospital Geral de Santo António, Porto, Portugal
| | - Denisa Mendonça
- Instituto de Ciencias Biomédicas Abel Salazar, Universidade do Porto, Porto, Portugal
- EPIUnit, Instituto de Saúde Pública, Universidade do Porto, Porto, Portugal
| |
Collapse
|
37
|
Zhang H, Wu L. A non‐linear model for censored and mismeasured time varying covariates in survival models, with applications in human immunodeficiency virus and acquired immune deficiency syndrome studies. J R Stat Soc Ser C Appl Stat 2018. [DOI: 10.1111/rssc.12279] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
| | - Lang Wu
- University of British Columbia Vancouver Canada
| |
Collapse
|
38
|
Saha-Chaudhuri P, Heagerty PJ. Dynamic thresholds and a summary ROC curve: Assessing prognostic accuracy of longitudinal markers. Stat Med 2018; 37:2700-2714. [PMID: 29671890 DOI: 10.1002/sim.7675] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2016] [Revised: 02/24/2018] [Accepted: 03/15/2018] [Indexed: 11/06/2022]
Abstract
Cancer patients, chronic kidney disease patients, and subjects infected with HIV are routinely monitored over time using biomarkers that represent key health status indicators. Furthermore, biomarkers are frequently used to guide initiation of new treatments or to inform changes in intervention strategies. Since key medical decisions can be made on the basis of a longitudinal biomarker, it is important to evaluate the potential accuracy associated with longitudinal monitoring. To characterize the overall accuracy of a time-dependent marker, we introduce a summary ROC curve that displays the overall sensitivity associated with a time-dependent threshold that controls time-varying specificity. The proposed statistical methods are similar to concepts considered in disease screening, yet our methods are novel in choosing a potentially time-dependent threshold to define a positive test, and our methods allow time-specific control of the false-positive rate. The proposed summary ROC curve is a natural averaging of time-dependent incident/dynamic ROC curves and therefore provides a single summary of net error rates that can be achieved in the longitudinal setting.
Collapse
Affiliation(s)
- P Saha-Chaudhuri
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada
| | - P J Heagerty
- Department of Biostatistics, University of Washington, Seattle, USA
| |
Collapse
|
39
|
Bhuyan P, Biswas J, Ghosh P, Das K. A Bayesian two-stage regression approach of analysing longitudinal outcomes with endogeneity and incompleteness. STAT MODEL 2018. [DOI: 10.1177/1471082x17747806] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Abstract: Two-stage regression methods are typically used for handling endogeneity in the simultaneous equations models in economics and other social sciences. However, the problem is challenging in the presence of incomplete response and/or incomplete endogenous covariate(s). We propose a Bayesian approach for the joint modelling of incomplete longitudinal continuous response and an incomplete count endogenous covariate, where the incompleteness is caused by the censorship through a selection mechanism. We define latent continuous variables which are left-censored at zero and develop a Gibbs sampling algorithm for the simultaneous estimation of the model parameters. We consider partially varying coefficients regression models containing covariates with fixed and time-varying effects on the response. Our work is motivated by a sample dataset from the Health and Retirement Study (HRS) for modelling the out-of-pocket medical cost, where the number of hospital admissions is considered as an endogenous covariate. Our analysis addresses some of the previously unanswered questions on the physical and financial health of the older population based on HRS data. Simulation studies are performed for assessing the usefulness of the proposed method compared to its competitors.
Collapse
Affiliation(s)
- Prajamitra Bhuyan
- Applied Statistics Unit, Indian Statistical Institute, Kolkata, India
| | - Jayabrata Biswas
- Interdisciplinary Statistical Research Unit, Indian Statistical Institute, Kolkata, India
| | - Pulak Ghosh
- Department of Quantitative Methods and Information Sciences, Indian Institute of Management, Bangalore, India
| | - Kiranmoy Das
- Interdisciplinary Statistical Research Unit, Indian Statistical Institute, Kolkata, India
| |
Collapse
|
40
|
Choi J, Zeng D, Olshan AF, Cai J. Joint modeling of survival time and longitudinal outcomes with flexible random effects. LIFETIME DATA ANALYSIS 2018; 24:126-152. [PMID: 28856493 PMCID: PMC5756108 DOI: 10.1007/s10985-017-9405-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 08/17/2017] [Indexed: 06/07/2023]
Abstract
Joint models with shared Gaussian random effects have been conventionally used in analysis of longitudinal outcome and survival endpoint in biomedical or public health research. However, misspecifying the normality assumption of random effects can lead to serious bias in parameter estimation and future prediction. In this paper, we study joint models of general longitudinal outcomes and survival endpoint but allow the underlying distribution of shared random effect to be completely unknown. For inference, we propose to use a mixture of Gaussian distributions as an approximation to this unknown distribution and adopt an Expectation-Maximization (EM) algorithm for computation. Either AIC and BIC criteria are adopted for selecting the number of mixtures. We demonstrate the proposed method via a number of simulation studies. We illustrate our approach with the data from the Carolina Head and Neck Cancer Study (CHANCE).
Collapse
Affiliation(s)
- Jaeun Choi
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, 1300 Morris Park Avenue, New York, NY, 10461, USA
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, McGavran-Greenberg Hl, 135 Dauer Drive, CB 7420, Chapel Hill, NC, 27599, USA
| | - Andrew F Olshan
- Department of Epidemiology, University of North Carolina at Chapel Hill, McGavran-Greenberg Hl, 135 Dauer Drive, CB 7435, Chapel Hill, NC, 27599, USA
| | - Jianwen Cai
- Department of Biostatistics, University of North Carolina at Chapel Hill, McGavran-Greenberg Hl, 135 Dauer Drive, CB 7420, Chapel Hill, NC, 27599, USA.
| |
Collapse
|
41
|
Sweeting MJ, Barrett JK, Thompson SG, Wood AM. The use of repeated blood pressure measures for cardiovascular risk prediction: a comparison of statistical models in the ARIC study. Stat Med 2017; 36:4514-4528. [PMID: 27730661 PMCID: PMC5724484 DOI: 10.1002/sim.7144] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Revised: 09/01/2016] [Accepted: 09/18/2016] [Indexed: 01/22/2023]
Abstract
Many prediction models have been developed for the risk assessment and the prevention of cardiovascular disease in primary care. Recent efforts have focused on improving the accuracy of these prediction models by adding novel biomarkers to a common set of baseline risk predictors. Few have considered incorporating repeated measures of the common risk predictors. Through application to the Atherosclerosis Risk in Communities study and simulations, we compare models that use simple summary measures of the repeat information on systolic blood pressure, such as (i) baseline only; (ii) last observation carried forward; and (iii) cumulative mean, against more complex methods that model the repeat information using (iv) ordinary regression calibration; (v) risk-set regression calibration; and (vi) joint longitudinal and survival models. In comparison with the baseline-only model, we observed modest improvements in discrimination and calibration using the cumulative mean of systolic blood pressure, but little further improvement from any of the complex methods. © 2016 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.
Collapse
Affiliation(s)
- Michael J Sweeting
- Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, U.K
| | - Jessica K Barrett
- Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, U.K
| | - Simon G Thompson
- Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, U.K
| | - Angela M Wood
- Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, U.K
| |
Collapse
|
42
|
Zhou QM, Dai W, Zheng Y, Cai T. Robust Dynamic Risk Prediction with Longitudinal Studies. ACTA ACUST UNITED AC 2017; 1:159-170. [PMID: 29335682 DOI: 10.1080/24754269.2017.1400418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Providing accurate and dynamic age-specific risk prediction is a crucial step in precision medicine. In this manuscript, we introduce an approach for estimating the τ-year age-specific absolute risk directly via a flexible varying coefficient model. The approach facilitates the utilization of predictors varying over an individual's lifetime. By using a nonparametric inverse probability weighted kernel estimating equation, the age-specific effects of risk factors are estimated without requiring the specification of the functional form. The approach allows borrowing information across individuals of similar ages, and therefore provides a practical solution for situations where the longitudinal information is only measured sparsely. We evaluate the performance of the proposed estimation and inference procedures with numerical studies, and make comparisons with existing methods in the literature. We illustrate the performance of our proposed approach by developing a dynamic prediction model using data from the Framingham Study.
Collapse
Affiliation(s)
- Qian M Zhou
- Department of Mathematics and Statistics, Mississippi State University, Mississippi State, Mississippi, USA, 39762
| | - Wei Dai
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA, 02115
| | - Yingye Zheng
- Department of Biostatistics and Biomathematics, Fred Hutchinson Cancer Research Center, Seattle, WA, USA, 98109
| | - Tianxi Cai
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA, 02115
| |
Collapse
|
43
|
Dagne GA. Joint two-part Tobit models for longitudinal and time-to-event data. Stat Med 2017; 36:4214-4229. [PMID: 28795414 DOI: 10.1002/sim.7429] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Revised: 04/20/2017] [Accepted: 07/07/2017] [Indexed: 11/06/2022]
Abstract
In this article, we show how Tobit models can address problems of identifying characteristics of subjects having left-censored outcomes in the context of developing a method for jointly analyzing time-to-event and longitudinal data. There are some methods for handling these types of data separately, but they may not be appropriate when time to event is dependent on the longitudinal outcome, and a substantial portion of values are reported to be below the limits of detection. An alternative approach is to develop a joint model for the time-to-event outcome and a two-part longitudinal outcome, linking them through random effects. This proposed approach is implemented to assess the association between the risk of decline of CD4/CD8 ratio and rates of change in viral load, along with discriminating between patients who are potentially progressors to AIDS from patients who do not. We develop a fully Bayesian approach for fitting joint two-part Tobit models and illustrate the proposed methods on simulated and real data from an AIDS clinical study.
Collapse
Affiliation(s)
- Getachew A Dagne
- Department of Epidemiology and Biostatistics, College of Public Health, MDC 56, University of South Florida, Tampa, FL 33612, USA
| |
Collapse
|
44
|
Köhler M, Umlauf N, Beyerlein A, Winkler C, Ziegler AG, Greven S. Flexible Bayesian additive joint models with an application to type 1 diabetes research. Biom J 2017; 59:1144-1165. [PMID: 28796339 DOI: 10.1002/bimj.201600224] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Revised: 06/07/2017] [Accepted: 06/08/2017] [Indexed: 01/13/2023]
Abstract
The joint modeling of longitudinal and time-to-event data is an important tool of growing popularity to gain insights into the association between a biomarker and an event process. We develop a general framework of flexible additive joint models that allows the specification of a variety of effects, such as smooth nonlinear, time-varying and random effects, in the longitudinal and survival parts of the models. Our extensions are motivated by the investigation of the relationship between fluctuating disease-specific markers, in this case autoantibodies, and the progression to the autoimmune disease type 1 diabetes. Using Bayesian P-splines, we are in particular able to capture highly nonlinear subject-specific marker trajectories as well as a time-varying association between the marker and event process allowing new insights into disease progression. The model is estimated within a Bayesian framework and implemented in the R-package bamlss.
Collapse
Affiliation(s)
- Meike Köhler
- Institute of Diabetes Research, Helmholtz Zentrum München, and Forschergruppe Diabetes, Klinikum rechts der Isar, Technische Universität München, Neuherberg, Germany
| | - Nikolaus Umlauf
- Department of Statistics, Faculty of Economics and Statistics, Universität Innsbruck, Innsbruck, Austria
| | - Andreas Beyerlein
- Institute of Diabetes Research, Helmholtz Zentrum München, and Forschergruppe Diabetes, Klinikum rechts der Isar, Technische Universität München, Neuherberg, Germany
| | - Christiane Winkler
- Institute of Diabetes Research, Helmholtz Zentrum München, and Forschergruppe Diabetes, Klinikum rechts der Isar, Technische Universität München, Neuherberg, Germany
| | - Anette-Gabriele Ziegler
- Institute of Diabetes Research, Helmholtz Zentrum München, and Forschergruppe Diabetes, Klinikum rechts der Isar, Technische Universität München, Neuherberg, Germany.,Forschergruppe Diabetes e.V., Neuherberg, Germany
| | - Sonja Greven
- Department of Statistics, Ludwig-Maximilians-Universität München, München, Germany
| |
Collapse
|
45
|
Taber DJ, Hamedi M, Rodrigue JR, Gebregziabher MG, Srinivas TR, Baliga PK, Egede LE. Quantifying the Race Stratified Impact of Socioeconomics on Graft Outcomes in Kidney Transplant Recipients. Transplantation 2017; 100:1550-7. [PMID: 26425875 DOI: 10.1097/tp.0000000000000931] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
BACKGROUND Socioeconomic status (SES) is a significant determinant of health outcomes and may be an important component of the causal chain surrounding racial disparities in kidney transplantation. The social adaptability index (SAI) is a validated and quantifiable measure of SES, with a lack of studies analyzing this measure longitudinally or between races. METHODS Longitudinal cohort study in adult kidney transplantation transplanted at a single-center between 2005 and 2012. The SAI score includes 5 domains (employment, education, marital status, substance abuse and income), each with a minimum of 0 and maximum of 3 for an aggregate of 0 to 15 (higher score → better SES). RESULTS One thousand one hundred seventy-one patients were included; 624 (53%) were African American (AA) and 547 were non-AA. African Americans had significantly lower mean baseline SAI scores (AAs 6.5 vs non-AAs 7.8; P < 0.001). Cox regression analysis demonstrated that there was no association between baseline SAI and acute rejection in non-AAs (hazard ratio [HR], 0.92; 95% confidence interval [95% CI], 0.81-1.05), whereas it was a significant predictor of acute rejection in AAs (HR, 0.89; 95% CI, 0.80-0.99). Similarly, a 2-stage approach to joint modelling of time to graft loss and longitudinal SAI did not predict graft loss in non-AAs (HR, 1.01; 95% CI, 0.28-3.62), whereas it was a significant predictor of graft loss in AAs (HR, 0.23; 95% CI, 0.06-0.93). CONCLUSIONS After controlling for confounders, SAI scores were associated with a lower risk of acute rejection and graft loss in AA kidney transplant recipients, whereas neither baseline nor follow-up SAI predicted outcomes in non-AA kidney transplant recipients.
Collapse
Affiliation(s)
- David J Taber
- 1 Division of Transplant Surgery, College of Medicine, Medical University of South Carolina, Charleston, SC. 2 Department of Pharmacy, Ralph H Johnson VAMC, Charleston, SC. 3 College of Medicine, Medical University of South Carolina, Charleston, SC. 4 Transplant Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA. 5 Department of Public Health Sciences, College of Medicine, Medical University of South Carolina, Charleston, SC. 6 Division of Transplant Nephrology, College of Medicine, Medical University of South Carolina, Charleston, SC. 7 Veterans Affairs HSR&D Health Equity and Rural Outreach Innovation Center (HEROIC), Ralph H Johnson VAMC, Charleston, SC
| | | | | | | | | | | | | |
Collapse
|
46
|
Overall Graft Loss Versus Death-Censored Graft Loss: Unmasking the Magnitude of Racial Disparities in Outcomes Among US Kidney Transplant Recipients. Transplantation 2017; 101:402-410. [PMID: 26901080 DOI: 10.1097/tp.0000000000001119] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
BACKGROUND Black kidney transplant recipients experience disproportionately high rates of graft loss. This disparity has persisted for 40 years, and improvements may be impeded based on the current public reporting of overall graft loss by US regulatory organizations for transplantation. METHODS Longitudinal cohort study of kidney transplant recipients using a data set created by linking Veterans Affairs and US Renal Data System information, including 4918 veterans transplanted between January 2001 and December 2007, with follow-up through December 2010. Multivariable analysis was conducted using 2-stage joint modeling of random and fixed effects of longitudinal data (linear mixed model) with time to event outcomes (Cox regression). RESULTS Three thousand three hundred six non-Hispanic whites (67%) were compared with 1612 non-Hispanic black (33%) recipients with 6.0 ± 2.2 years of follow-up. In the unadjusted analysis, black recipients were significantly more likely to have overall graft loss (hazard ratio [HR], 1.19; 95% confidence interval [95% CI], 1.07-1.33), death-censored graft loss (HR, 1.67; 95% CI, 1.45-1.92), and lower mortality (HR, 0.83; 95% CI, 0.72-0.96). In fully adjusted models, only death-censored graft loss remained significant (HR, 1.38; 95% CI, 1.12-1.71; overall graft loss [HR, 1.08; 95% CI, 0.91-1.28]; mortality [HR, 0.84; 95% CI, 0.67-1.06]). A composite definition of graft loss reduced the magnitude of disparities in blacks by 22%. CONCLUSIONS Non-Hispanic black kidney transplant recipients experience a substantial disparity in graft loss, but not mortality. This study of US data provides evidence to suggest that researchers should focus on using death-censored graft loss as the primary outcome of interest to facilitate a better understanding of racial disparities in kidney transplantation.
Collapse
|
47
|
Yuen HP, Mackinnon A. Performance of joint modelling of time-to-event data with time-dependent predictors: an assessment based on transition to psychosis data. PeerJ 2016; 4:e2582. [PMID: 27781169 PMCID: PMC5075698 DOI: 10.7717/peerj.2582] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 09/18/2016] [Indexed: 11/28/2022] Open
Abstract
Joint modelling has emerged to be a potential tool to analyse data with a time-to-event outcome and longitudinal measurements collected over a series of time points. Joint modelling involves the simultaneous modelling of the two components, namely the time-to-event component and the longitudinal component. The main challenges of joint modelling are the mathematical and computational complexity. Recent advances in joint modelling have seen the emergence of several software packages which have implemented some of the computational requirements to run joint models. These packages have opened the door for more routine use of joint modelling. Through simulations and real data based on transition to psychosis research, we compared joint model analysis of time-to-event outcome with the conventional Cox regression analysis. We also compared a number of packages for fitting joint models. Our results suggest that joint modelling do have advantages over conventional analysis despite its potential complexity. Our results also suggest that the results of analyses may depend on how the methodology is implemented.
Collapse
Affiliation(s)
- Hok Pan Yuen
- Orygen, The National Centre of Excellence in Youth Mental Health, Parkville, Victoria, Australia; Centre for Youth Mental Health, The University of Melbourne, Parkville, Victoria, Australia
| | - Andrew Mackinnon
- Centre for Mental Health, Melbourne School of Population and Global Health, The University of Melbourne, Parkville, Victoria, Australia; Black Dog Institute and University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
48
|
Murphy TE, Allore HG, Han L, Peduzzi PN, Gill TM, Xu X, Lin H. A longitudinal, observational study with many repeated measures demonstrated improved precision of individual survival curves using Bayesian joint modeling of disability and survival. Exp Aging Res 2016; 41:221-39. [PMID: 25978444 DOI: 10.1080/0361073x.2015.1021640] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
UNLABELLED BACKGROUND/STUDY CONTEXT: It has not been previously demonstrated whether Bayesian joint modeling (BJM) of disability and survival can, under certain conditions, improve precision of individual survival curves. METHODS A longitudinal, observational study wherein 754 initially nondisabled community-dwelling adults in greater New Haven, Connecticut, were observed on a monthly basis for over 10 years. RESULTS In this study, BJM exploited many monthly observations to demonstrate, relative to a separate survival model with adjustment, improved precision of individual survival curves, permitting detection of significant differences between survival curves of two similar individuals. The gain in precision was lost when using only those observations from intervals of 6, 9, or 12 months. CONCLUSION When there are many repeated measures, BJM of longitudinal functional disability and interval-censored survival can potentially increase the precision of individual survival curves relative to those from a separate survival model. This may facilitate the identification of significant differences between individual survival curves, a useful result usually precluded by the large variability inherent to individual-level estimates from stand-alone survival models.
Collapse
Affiliation(s)
- Terrence E Murphy
- a Department of Internal Medicine , Yale School of Medicine , New Haven , Connecticut , USA
| | | | | | | | | | | | | |
Collapse
|
49
|
Zhao X, Zhou X. Semiparametric models of longitudinal and time-to-event data with applications to HIV viral dynamics and CD4 counts. J Appl Stat 2015. [DOI: 10.1080/02664763.2015.1043859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
50
|
Lawrence Gould A, Boye ME, Crowther MJ, Ibrahim JG, Quartey G, Micallef S, Bois FY. Joint modeling of survival and longitudinal non-survival data: current methods and issues. Report of the DIA Bayesian joint modeling working group. Stat Med 2015; 34:2181-95. [PMID: 24634327 PMCID: PMC4677775 DOI: 10.1002/sim.6141] [Citation(s) in RCA: 87] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Accepted: 02/19/2014] [Indexed: 12/25/2022]
Abstract
Explicitly modeling underlying relationships between a survival endpoint and processes that generate longitudinal measured or reported outcomes potentially could improve the efficiency of clinical trials and provide greater insight into the various dimensions of the clinical effect of interventions included in the trials. Various strategies have been proposed for using longitudinal findings to elucidate intervention effects on clinical outcomes such as survival. The application of specifically Bayesian approaches for constructing models that address longitudinal and survival outcomes explicitly has been recently addressed in the literature. We review currently available methods for carrying out joint analyses, including issues of implementation and interpretation, identify software tools that can be used to carry out the necessary calculations, and review applications of the methodology.
Collapse
Affiliation(s)
- A Lawrence Gould
- Merck Research Laboratories, 351 North Sumneytown Pike, North Wales, PA 19454, U.S.A
| | - Mark Ernest Boye
- Eli Lilly, 893 S. Delaware Street, Indianapolis, IN 46285, U.S.A
| | - Michael J Crowther
- Department of Health Sciences, University of Leicester, Adrian Building, University Road, Leicester LE1 7RH, U.K
| | - Joseph G Ibrahim
- Department of Statistics and Operations Research, University of North Carolina, 318 Hanes Hall Chapel Hill, NC 27599, U.S.A
| | | | | | - Frederic Y Bois
- Université de Technologie de Compiègne, Centre de Recherche de Royallieu, 60205 Compiègne Cedex, France
- INERIS/CRD/VIVA/METO, Verneuil en Halatte, France
| |
Collapse
|