1
|
Kürüm E, Nguyen DV, Qian Q, Banerjee S, Rhee CM, Şentürk D. Spatiotemporal multilevel joint modeling of longitudinal and survival outcomes in end-stage kidney disease. LIFETIME DATA ANALYSIS 2024:10.1007/s10985-024-09635-w. [PMID: 39367291 DOI: 10.1007/s10985-024-09635-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 09/06/2024] [Indexed: 10/06/2024]
Abstract
Individuals with end-stage kidney disease (ESKD) on dialysis experience high mortality and excessive burden of hospitalizations over time relative to comparable Medicare patient cohorts without kidney failure. A key interest in this population is to understand the time-dynamic effects of multilevel risk factors that contribute to the correlated outcomes of longitudinal hospitalization and mortality. For this we utilize multilevel data from the United States Renal Data System (USRDS), a national database that includes nearly all patients with ESKD, where repeated measurements/hospitalizations over time are nested in patients and patients are nested within (health service) regions across the contiguous U.S. We develop a novel spatiotemporal multilevel joint model (STM-JM) that accounts for the aforementioned hierarchical structure of the data while considering the spatiotemporal variations in both outcomes across regions. The proposed STM-JM includes time-varying effects of multilevel (patient- and region-level) risk factors on hospitalization trajectories and mortality and incorporates spatial correlations across the spatial regions via a multivariate conditional autoregressive correlation structure. Efficient estimation and inference are performed via a Bayesian framework, where multilevel varying coefficient functions are targeted via thin-plate splines. The finite sample performance of the proposed method is assessed through simulation studies. An application of the proposed method to the USRDS data highlights significant time-varying effects of patient- and region-level risk factors on hospitalization and mortality and identifies specific time periods on dialysis and spatial locations across the U.S. with elevated hospitalization and mortality risks.
Collapse
Affiliation(s)
- Esra Kürüm
- Department of Statistics, University of California, Riverside, CA, 92521, USA.
| | - Danh V Nguyen
- Department of Medicine, University of California Irvine, Orange, CA, 92868, USA
| | - Qi Qian
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA
| | - Sudipto Banerjee
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA
| | - Connie M Rhee
- Department of Medicine, University of California, Los Angeles, CA, 90095, USA
- Nephrology Section, VA Greater Los Angeles Health Care System, Los Angeles, CA, 90073, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA
| |
Collapse
|
2
|
Kürüm E, Nguyen DV, Li Y, Rhee CM, Kalantar‐Zadeh K, Şentürk D. Multilevel joint modeling of hospitalization and survival in patients on dialysis. Stat (Int Stat Inst) 2021. [DOI: 10.1002/sta4.356] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Esra Kürüm
- Department of Statistics University of California Riverside 92521 California USA
| | - Danh V. Nguyen
- Department of Medicine University of California Irvine Orange 92868 California USA
| | - Yihao Li
- Department of Biostatistics University of California Los Angeles 90095 California USA
| | - Connie M. Rhee
- Department of Medicine University of California Irvine Orange 92868 California USA
- Harold Simmons Center for Chronic Disease Research and Epidemiology University of California Irvine School of Medicine Orange 92868 California USA
| | - Kamyar Kalantar‐Zadeh
- Department of Medicine University of California Irvine Orange 92868 California USA
- Harold Simmons Center for Chronic Disease Research and Epidemiology University of California Irvine School of Medicine Orange 92868 California USA
| | - Damla Şentürk
- Department of Biostatistics University of California Los Angeles 90095 California USA
| |
Collapse
|
3
|
Campos LF, Şentürk D, Chen Y, Nguyen DV. Bias and estimation under misspecification of the risk period in self-controlled case series studies. Stat (Int Stat Inst) 2017; 6:373-389. [PMID: 30473787 DOI: 10.1002/sta4.166] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The self-controlled case series (SCCS) method is useful for estimating the relative incidence (RI) of acute events, such as adverse events (AEs) during a specified risk period following an exposure (e.g., 6-week period after vaccinations or 30-day period after infection-related hospitalizations). In practice, the "optimal" risk period is unknown and must be specified. To date, two approaches are available to guide the specification of the risk period. Both methods do not fully utilize the nature of the bias due to misspecification, which to date has not been characterized. Thus, we elucidate the bias of SCCS estimate of the RI when the risk period is misspecified. We then propose a novel method that more effectively estimates the optimal risk period and the associated RI of AEs. The new method incorporates information on the functional form of the bias. Efficacy of the proposed approach is illustrated with substantial reduction in bias and variance in simulation studies. The proposed method is illustrated with two SCCS studies to determine the (1) risk of idiopathic thrombocytopenic purpura after measles-mumps-rubella vaccination in children and (2) risk of cardiovascular events after infection-related hospitalizations in older patients on dialysis.
Collapse
Affiliation(s)
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
| | - Yanjun Chen
- Institute for Clinical and Translational Science, Irvine, CA 92617, USA
| | - Danh V Nguyen
- Department of Medicine, University of California Irvine, Orange, CA 92868, USA
| |
Collapse
|
4
|
Estes JP, Nguyen DV, Dalrymple LS, Mu Y, Şentürk D. Time-varying effect modeling with longitudinal data truncated by death: conditional models, interpretations, and inference. Stat Med 2015; 35:1834-47. [PMID: 26646582 DOI: 10.1002/sim.6836] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Accepted: 11/14/2015] [Indexed: 11/07/2022]
Abstract
Recent studies found that infection-related hospitalization was associated with increased risk of cardiovascular (CV) events, such as myocardial infarction and stroke in the dialysis population. In this work, we develop time-varying effects modeling tools in order to examine the CV outcome risk trajectories during the time periods before and after an initial infection-related hospitalization. For this, we propose partly conditional and fully conditional partially linear generalized varying coefficient models (PL-GVCMs) for modeling time-varying effects in longitudinal data with substantial follow-up truncation by death. Unconditional models that implicitly target an immortal population is not a relevant target of inference in applications involving a population with high mortality, like the dialysis population. A partly conditional model characterizes the outcome trajectory for the dynamic cohort of survivors, where each point in the longitudinal trajectory represents a snapshot of the population relationships among subjects who are alive at that time point. In contrast, a fully conditional approach models the time-varying effects of the population stratified by the actual time of death, where the mean response characterizes individual trends in each cohort stratum. We compare and contrast partly and fully conditional PL-GVCMs in our aforementioned application using hospitalization data from the United States Renal Data System. For inference, we develop generalized likelihood ratio tests. Simulation studies examine the efficacy of estimation and inference procedures.
Collapse
Affiliation(s)
- Jason P Estes
- Department of Biostatistics, University of California, Los Angeles, 90095, California, U.S.A
| | - Danh V Nguyen
- Department of Medicine, UC Irvine School of Medicine, Orange, 92868-3298, California, U.S.A.,Institute for Clinical and Translational Science, University of California, Irvine, 92687-1385, California, U.S.A
| | - Lorien S Dalrymple
- Division of Nephrology, Department of Medicine, University of California, Sacramento, 95817, California, U.S.A
| | - Yi Mu
- Graduate Group in Epidemiology, University of California, Davis, 95616, California, U.S.A
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, 90095, California, U.S.A.,Department of Statistics, University of California, Los Angeles, 90095, California, U.S.A
| |
Collapse
|
5
|
Sentürk D, Dalrymple LS, Mu Y, Nguyen DV. Weighted hurdle regression method for joint modeling of cardiovascular events likelihood and rate in the US dialysis population. Stat Med 2014; 33:4387-401. [PMID: 24930810 DOI: 10.1002/sim.6232] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Revised: 02/08/2014] [Accepted: 05/22/2014] [Indexed: 11/08/2022]
Abstract
We propose a new weighted hurdle regression method for modeling count data, with particular interest in modeling cardiovascular events in patients on dialysis. Cardiovascular disease remains one of the leading causes of hospitalization and death in this population. Our aim is to jointly model the relationship/association between covariates and (i) the probability of cardiovascular events, a binary process, and (ii) the rate of events once the realization is positive-when the 'hurdle' is crossed-using a zero-truncated Poisson distribution. When the observation period or follow-up time, from the start of dialysis, varies among individuals, the estimated probability of positive cardiovascular events during the study period will be biased. Furthermore, when the model contains covariates, then the estimated relationship between the covariates and the probability of cardiovascular events will also be biased. These challenges are addressed with the proposed weighted hurdle regression method. Estimation for the weighted hurdle regression model is a weighted likelihood approach, where standard maximum likelihood estimation can be utilized. The method is illustrated with data from the United States Renal Data System. Simulation studies show the ability of proposed method to successfully adjust for differential follow-up times and incorporate the effects of covariates in the weighting.
Collapse
Affiliation(s)
- Damla Sentürk
- Department of Biostatistics, University of California, Los Angeles, CA 90095, U.S.A
| | | | | | | |
Collapse
|
6
|
Estes JP, Nguyen DV, Dalrymple LS, Mu Y, Şentürk D. Cardiovascular event risk dynamics over time in older patients on dialysis: a generalized multiple-index varying coefficient model approach. Biometrics 2014; 70:754-64. [PMID: 24766178 PMCID: PMC4209204 DOI: 10.1111/biom.12176] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2013] [Revised: 02/01/2014] [Accepted: 03/01/2014] [Indexed: 11/29/2022]
Abstract
Among patients on dialysis, cardiovascular disease and infection are leading causes of hospitalization and death. Although recent studies have found that the risk of cardiovascular events is higher after an infection-related hospitalization, studies have not fully elucidated how the risk of cardiovascular events changes over time for patients on dialysis. In this work, we characterize the dynamics of cardiovascular event risk trajectories for patients on dialysis while conditioning on survival status via multiple time indices: (1) time since the start of dialysis, (2) time since the pivotal initial infection-related hospitalization, and (3) the patient's age at the start of dialysis. This is achieved by using a new class of generalized multiple-index varying coefficient (GM-IVC) models. The proposed GM-IVC models utilize a multiplicative structure and one-dimensional varying coefficient functions along each time and age index to capture the cardiovascular risk dynamics before and after the initial infection-related hospitalization among the dynamic cohort of survivors. We develop a two-step estimation procedure for the GM-IVC models based on local maximum likelihood. We report new insights on the dynamics of cardiovascular events risk using the United States Renal Data System database, which collects data on nearly all patients with end-stage renal disease in the United States. Finally, simulation studies assess the performance of the proposed estimation procedures.
Collapse
Affiliation(s)
- Jason P. Estes
- Department of Biostatistics, University of California, Los Angeles, California 90095, U.S.A
| | - Danh V. Nguyen
- Department of Medicine, University of California, Irvine, California 92868, U.S.A
- Institute for Clinical and Translational Science, University of California, Irvine, California 92687, U.S.A
| | - Lorien S. Dalrymple
- Division of Nephrology, Department of Medicine, University of California, Sacramento, California 95817, U.S.A
| | - Yi Mu
- Graduate Group in Epidemiology, University of California, Davis, California 95616, U.S.A
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, California 90095, U.S.A
| |
Collapse
|
7
|
Mohammed SM, Dalrymple LS, Şentürk D, Nguyen DV. Naive hypothesis testing for case series analysis with time-varying exposure onset measurement error: inference for infection-cardiovascular risk in patients on dialysis. Biometrics 2013; 69:520-9. [PMID: 23731166 DOI: 10.1111/biom.12033] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2012] [Revised: 12/01/2012] [Accepted: 01/01/2013] [Indexed: 11/26/2022]
Abstract
The case series method is useful in studying the relationship between time-varying exposures, such as infections, and acute events observed during the observation periods of individuals. It provides estimates of the relative incidences of events in risk periods (e.g., 30-day period after infections) relative to the baseline periods. When the times of exposure onsets are not known precisely, application of the case series model ignoring exposure onset measurement error leads to biased estimates. Bias-correction is necessary in order to understand the true directions and effect sizes associated with exposure risk periods, although uncorrected estimators have smaller variance. Thus, inference via hypothesis testing based on uncorrected test statistics, if valid, is potentially more powerful. Furthermore, the tests can be implemented in standard software and do not require additional auxiliary data. In this work, we examine the validity and power of naive hypothesis testing, based on applying the case series analysis to the imprecise data without correcting for the error. Based on simulation studies and theoretical calculations, we determine the validity and relative power of common hypothesis tests of interest in case series analysis. In particular, we illustrate that the tests for the global null hypothesis, the overall null hypotheses associated with all risk periods or all age effects are valid. However, tests of individual risk period parameters are not generally valid. Practical guidelines are provided and illustrated with data from patients on dialysis.
Collapse
Affiliation(s)
- Sandra M Mohammed
- Division of Biostatistics, Department of Public Health Sciences, University of California, Davis, CA 95616, USA
| | | | | | | |
Collapse
|
8
|
Mohammed SM, Dalrymple LS, Şentürk D, Nguyen DV. Design considerations for case series models with exposure onset measurement error. Stat Med 2013; 32:772-86. [DOI: 10.1002/sim.5552] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2012] [Accepted: 07/03/2012] [Indexed: 11/10/2022]
Affiliation(s)
- Sandra M. Mohammed
- Division of Biostatistics, Department of Public Health Sciences; University of California; Davis CA 95616 U.S.A
| | - Lorien S. Dalrymple
- Division of Nephrology, Department of Medicine; University of California; Sacramento CA 95691 U.S.A
| | - Damla Şentürk
- Department of Biostatistics; University of California; Los Angeles CA 90095 U.S.A
| | - Danh V. Nguyen
- Division of Biostatistics, Department of Public Health Sciences; University of California; Davis CA 95616 U.S.A
| |
Collapse
|