1
|
Pan C, Cai B, Sui X. A Bayesian proportional hazards mixture cure model for interval-censored data. LIFETIME DATA ANALYSIS 2024; 30:327-344. [PMID: 38015378 DOI: 10.1007/s10985-023-09613-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 10/12/2023] [Indexed: 11/29/2023]
Abstract
The proportional hazards mixture cure model is a popular analysis method for survival data where a subgroup of patients are cured. When the data are interval-censored, the estimation of this model is challenging due to its complex data structure. In this article, we propose a computationally efficient semiparametric Bayesian approach, facilitated by spline approximation and Poisson data augmentation, for model estimation and inference with interval-censored data and a cure rate. The spline approximation and Poisson data augmentation greatly simplify the MCMC algorithm and enhance the convergence of the MCMC chains. The empirical properties of the proposed method are examined through extensive simulation studies and also compared with the R package "GORCure". The use of the proposed method is illustrated through analyzing a data set from the Aerobics Center Longitudinal Study.
Collapse
Affiliation(s)
- Chun Pan
- Department of Mathematics and Statistics, Hunter College, New York, NY, 10065, USA.
| | - Bo Cai
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, SC, 29208, USA
| | - Xuemei Sui
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, SC, 29208, USA
| |
Collapse
|
2
|
Hu X, Su W, Ye Z, Zhao X. Conditional modeling of panel count data with partly interval-censored failure event. Biometrics 2024; 80:ujae020. [PMID: 38497823 DOI: 10.1093/biomtc/ujae020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 02/19/2024] [Accepted: 02/28/2024] [Indexed: 03/19/2024]
Abstract
In longitudinal follow-up studies, panel count data arise from discrete observations on recurrent events. We investigate a more general situation where a partly interval-censored failure event is informative to recurrent events. The existing methods for the informative failure event are based on the latent variable model, which provides indirect interpretation for the effect of failure event. To solve this problem, we propose a failure-time-dependent proportional mean model with panel count data through an unspecified link function. For estimation of model parameters, we consider a conditional expectation of least squares function to overcome the challenges from partly interval-censoring, and develop a two-stage estimation procedure by treating the distribution function of the failure time as a functional nuisance parameter and using the B-spline functions to approximate unknown baseline mean and link functions. Furthermore, we derive the overall convergence rate of the proposed estimators and establish the asymptotic normality of finite-dimensional estimator and functionals of infinite-dimensional estimator. The proposed estimation procedure is evaluated by extensive simulation studies, in which the finite-sample performances coincide with the theoretical results. We further illustrate our method with a longitudinal healthy longevity study and draw some insightful conclusions.
Collapse
Affiliation(s)
- Xiangbin Hu
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong
| | - Wen Su
- Department of Biostatistics, City University of Hong Kong, Hong Kong
| | - Zhisheng Ye
- Department of Industrial Systems Engineering and Management, National University of Singapore, Singapore
| | - Xingqiu Zhao
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong
| |
Collapse
|
3
|
Chen LP, Qiu B. Analysis of length-biased and partly interval-censored survival data with mismeasured covariates. Biometrics 2023; 79:3929-3940. [PMID: 37458679 DOI: 10.1111/biom.13898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 06/22/2023] [Indexed: 12/21/2023]
Abstract
In this paper, we analyze the length-biased and partly interval-censored data, whose challenges primarily come from biased sampling and interfere induced by interval censoring. Unlike existing methods that focus on low-dimensional data and assume the covariates to be precisely measured, sometimes researchers may encounter high-dimensional data subject to measurement error, which are ubiquitous in applications and make estimation unreliable. To address those challenges, we explore a valid inference method for handling high-dimensional length-biased and interval-censored survival data with measurement error in covariates under the accelerated failure time model. We primarily employ the SIMEX method to correct for measurement error effects and propose the boosting procedure to do variable selection and estimation. The proposed method is able to handle the case that the dimension of covariates is larger than the sample size and enjoys appealing features that the distributions of the covariates are left unspecified.
Collapse
Affiliation(s)
- Li-Pang Chen
- Department of Statistics, National Chengchi University, Taipei, Taiwan
| | - Bangxu Qiu
- Department of Statistics, National Chengchi University, Taipei, Taiwan
| |
Collapse
|
4
|
Qiu M, Hu T. Bayesian transformation model for spatial partly interval-censored data. J Appl Stat 2023; 51:2139-2156. [PMID: 39157272 PMCID: PMC11328804 DOI: 10.1080/02664763.2023.2263819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/20/2024]
Abstract
The transformation model with partly interval-censored data offers a highly flexible modeling framework that can simultaneously support multiple common survival models and a wide variety of censored data types. However, the real data may contain unexplained heterogeneity that cannot be entirely explained by covariates and may be brought on by a variety of unmeasured regional characteristics. Due to this, we introduce the conditionally autoregressive prior into the transformation model with partly interval-censored data and take the spatial frailty into account. An efficient Markov chain Monte Carlo method is proposed to handle the posterior sampling and model inference. The approach is simple to use and does not include any challenging Metropolis steps owing to four-stage data augmentation. Through several simulations, the suggested method's empirical performance is assessed and then the method is used in a leukemia study.
Collapse
Affiliation(s)
- Mingyue Qiu
- School of Mathematical Sciences, Capital Normal University, Beijing, People's Republic of China
| | - Tao Hu
- School of Mathematical Sciences, Capital Normal University, Beijing, People's Republic of China
| |
Collapse
|
5
|
Shao L, Li H, Li S, Sun J. A pairwise pseudo-likelihood approach for regression analysis of left-truncated failure time data with various types of censoring. BMC Med Res Methodol 2023; 23:82. [PMID: 37016341 PMCID: PMC10071649 DOI: 10.1186/s12874-023-01903-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 03/26/2023] [Indexed: 04/06/2023] Open
Abstract
BACKGROUND Failure time data frequently occur in many medical studies and often accompany with various types of censoring. In some applications, left truncation may occur and can induce biased sampling, which makes the practical data analysis become more complicated. The existing analysis methods for left-truncated data have some limitations in that they either focus only on a special type of censored data or fail to flexibly utilize the distribution information of the truncation times for inference. Therefore, it is essential to develop a reliable and efficient method for the analysis of left-truncated failure time data with various types of censoring. METHOD This paper concerns regression analysis of left-truncated failure time data with the proportional hazards model under various types of censoring mechanisms, including right censoring, interval censoring and a mixture of them. The proposed pairwise pseudo-likelihood estimation method is essentially built on a combination of the conditional likelihood and the pairwise likelihood that eliminates the nuisance truncation distribution function or avoids its estimation. To implement the presented method, a flexible EM algorithm is developed by utilizing the idea of self-consistent estimating equation. A main feature of the algorithm is that it involves closed-form estimators of the large-dimensional nuisance parameters and is thus computationally stable and reliable. In addition, an R package LTsurv is developed. RESULTS The numerical results obtained from extensive simulation studies suggest that the proposed pairwise pseudo-likelihood method performs reasonably well in practical situations and is obviously more efficient than the conditional likelihood approach as expected. The analysis results of the MHCPS data with the proposed pairwise pseudo-likelihood method indicate that males have significantly higher risk of losing active life than females. In contrast, the conditional likelihood method recognizes this effect as non-significant, which is because the conditional likelihood method often loses some estimation efficiency compared with the proposed method. CONCLUSIONS The proposed method provides a general and helpful tool to conduct the Cox's regression analysis of left-truncated failure time data under various types of censoring.
Collapse
Affiliation(s)
- Li Shao
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Hongxi Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China.
| | - Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
6
|
Withana Gamage PW, McMahan CS, Wang L. A flexible parametric approach for analyzing arbitrarily censored data that are potentially subject to left truncation under the proportional hazards model. LIFETIME DATA ANALYSIS 2023; 29:188-212. [PMID: 36208362 PMCID: PMC9852023 DOI: 10.1007/s10985-022-09579-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 09/23/2022] [Indexed: 06/16/2023]
Abstract
The proportional hazards (PH) model is, arguably, the most popular model for the analysis of lifetime data arising from epidemiological studies, among many others. In such applications, analysts may be faced with censored outcomes and/or studies which institute enrollment criterion leading to left truncation. Censored outcomes arise when the event of interest is not observed but rather is known relevant to an observation time(s). Left truncated data occur in studies that exclude participants who have experienced the event prior to being enrolled in the study. If not accounted for, both of these features can lead to inaccurate inferences about the population under study. Thus, to overcome this challenge, herein we propose a novel unified PH model that can be used to accommodate both of these features. In particular, our approach can seamlessly analyze exactly observed failure times along with interval-censored observations, while aptly accounting for left truncation. To facilitate model fitting, an expectation-maximization algorithm is developed through the introduction of carefully structured latent random variables. To provide modeling flexibility, a monotone spline representation is used to approximate the cumulative baseline hazard function. The performance of our methodology is evaluated through a simulation study and is further illustrated through the analysis of two motivating data sets; one that involves child mortality in Nigeria and the other prostate cancer.
Collapse
Affiliation(s)
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC, 29634, USA
| | - Lianming Wang
- Department of Statistics, University of South Carolina, Columbia, SC, 29208, USA
| |
Collapse
|
7
|
Abstract
Partly interval-censored data often occur in cancer clinical trials and have been analyzed as right-censored data. Patients' geographic information sometimes is also available and can be useful in testing treatment effects and predicting survivorship. We propose a Bayesian semiparametric method for analyzing partly interval-censored data with areal spatial information under the proportional hazards model. A simulation study is conducted to compare the performance of the proposed method with the main method currently available in the literature and the traditional Cox proportional hazards model for right-censored data. The method is illustrated through a leukemia survival data set and a dental health data set. The proposed method will be especially useful for analyzing progression-free survival in multi-regional cancer clinical trials.
Collapse
Affiliation(s)
- Chun Pan
- Department of Mathematics and Statistics, Hunter College, New York, NY 10065,,
| | - Bo Cai
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, SC 29208
| |
Collapse
|
8
|
Wang C, Jiang J, Song X. Bayesian transformation models with partly interval-censored data. Stat Med 2021; 41:1263-1279. [PMID: 34845732 DOI: 10.1002/sim.9271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 10/27/2021] [Accepted: 11/04/2021] [Indexed: 11/07/2022]
Abstract
In many scientific fields, partly interval-censored data, which consist of exactly observed and interval-censored observations on the failure time of interest, appear frequently. However, methodological developments in the analysis of partly interval-censored data are relatively limited and have mainly focused on additive or proportional hazards models. The general linear transformation model provides a highly flexible modeling framework that includes several familiar survival models as special cases. Despite such nice features, the inference procedure for this class of models has not been developed for partly interval-censored data. We propose a fully Bayesian approach coped with efficient Markov chain Monte Carlo methods to fill this gap. A four-stage data augmentation procedure is introduced to tackle the challenges presented by the complex model and data structure. The proposed method is easy to implement and computationally attractive. The empirical performance of the proposed method is evaluated through two simulation studies, and the model is then applied to a dental health study.
Collapse
Affiliation(s)
- Chunjie Wang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, China
| | - Jingjing Jiang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, China
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
9
|
Sun L, Li S, Wang L, Song X. A semiparametric mixture model approach for regression analysis of partly interval-censored data with a cured subgroup. Stat Methods Med Res 2021; 30:1890-1903. [PMID: 34197261 DOI: 10.1177/09622802211023985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Failure time data with a cured subgroup are frequently confronted in various scientific fields and many methods have been proposed for their analysis under right or interval censoring. However, a cure model approach does not seem to exist in the analysis of partly interval-censored data, which consist of both exactly observed and interval-censored observations on the failure time of interest. In this article, we propose a two-component mixture cure model approach for analyzing such type of data. We employ a logistic model to describe the cured probability and a proportional hazards model to model the latent failure time distribution for uncured subjects. We consider maximum likelihood estimation and develop a new expectation-maximization algorithm for its implementation. The asymptotic properties of the resulting estimators are established and the finite sample performance of the proposed method is examined through simulation studies. An application to a set of real data on childhood mortality in Nigeria is provided.
Collapse
Affiliation(s)
- Liuquan Sun
- School of Economics and Statistics, Guangzhou University, Guangzhou, China.,Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Lianming Wang
- Department of Statistics, University of South Carolina, Columbia, USA
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong
| |
Collapse
|
10
|
Wang L, Wang L. Regression analysis of arbitrarily censored survival data under the proportional odds model. Stat Med 2021; 40:3724-3739. [PMID: 33882618 DOI: 10.1002/sim.8994] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 02/13/2021] [Accepted: 03/29/2021] [Indexed: 11/09/2022]
Abstract
Arbitrarily censored data are referred to as the survival data that contain a mixture of exactly observed, left-censored, interval-censored, and right-censored observations. Existing research work on regression analysis on arbitrarily censored data is relatively sparse and mainly focused on the proportional hazards model and the accelerated failure time model. This article studies the proportional odds (PO) model and proposes a novel estimation approach through an expectation-maximization (EM) algorithm for analyzing such data. The proposed EM algorithm has many appealing properties such as being robust to initial values, easy to implement, converging fast, and providing the variance estimate of the regression parameter estimate in closed form. An informal diagnosis plot is developed for checking the PO model assumption. Our method has shown excellent performance in estimating the regression parameters as well as the baseline survival function in a simulation study. A real-life dataset about metastatic colorectal cancer is analyzed for illustration. An R package regPO has been created for practitioners to implement our method.
Collapse
Affiliation(s)
- Lu Wang
- Department of Mathematics, Western New England University, Springfield, Massachusetts, USA
| | - Lianming Wang
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| |
Collapse
|