1
|
González-Manteiga W, Martínez-Miranda MD, Van Keilegom I. Goodness-of-fit tests in proportional hazards models with random effects. Biom J 2023; 65:e2000353. [PMID: 35790474 PMCID: PMC10083947 DOI: 10.1002/bimj.202000353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 12/23/2021] [Accepted: 02/20/2022] [Indexed: 01/17/2023]
Abstract
This paper deals with testing the functional form of the covariate effects in a Cox proportional hazards model with random effects. We assume that the responses are clustered and incomplete due to right censoring. The estimation of the model under the null (parametric covariate effect) and the alternative (nonparametric effect) is performed using the full marginal likelihood. Under the alternative, the nonparametric covariate effects are estimated using orthogonal expansions. The test statistic is the likelihood ratio statistic, and its distribution is approximated using a bootstrap method. The performance of the proposed testing procedure is studied through simulations. The method is also applied on two real data sets one from biomedical research and one from veterinary medicine.
Collapse
Affiliation(s)
- Wenceslao González-Manteiga
- Department of Statistics, Mathematical Analysis and Operational Research, University of Santiago de Compostela, Santiago de Compostela, Spain
| | | | | |
Collapse
|
2
|
Han B, Van Keilegom I, Wang X. Semiparametric estimation of the nonmixture cure model with auxiliary survival information. Biometrics 2021; 78:448-459. [PMID: 33721326 DOI: 10.1111/biom.13450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 01/06/2021] [Accepted: 02/15/2021] [Indexed: 12/01/2022]
Abstract
With rapidly increasing data sources, statistical methods that make use of external information are gradually becoming popular tools in medical research. In this article, we efficiently synthesize the auxiliary survival information and propose a semiparametric estimation method for the combined empirical likelihood in the framework of the nonmixture cure model, to enhance inference about the associations between exposures and disease outcomes. The auxiliary survival probabilities from external sources are first summarized as unbiased estimation equations, which help produce more efficient estimates of the effects of interest and improve the prediction accuracy for the risk of the event. Then we develop a Bernstein-based sieve empirical likelihood method to estimate the parametric and nonparametric components simultaneously. Such an estimation procedure allows us to reduce the computation burden while preserving the shape constraint on the baseline distribution function. The resulting estimators for the true associations are strongly consistent and asymptotically normal. Instead of collecting substantial exposure data, the auxiliary survival information at multiple time points is incorporated, which further reduces the mean squared error of the estimators. This contributes to biomarker evaluation and treatment effect analysis within smaller studies. We show how to choose the number of auxiliary survival probabilities appropriately and provide a guideline for practical applications. Simulation studies demonstrate that the estimators enjoy large gains in efficiency. A melanoma dataset is analyzed for illustrating the methodology.
Collapse
Affiliation(s)
- Bo Han
- School of Mathematical Sciences, Dalian University of Technology, Liaoning, China
| | - Ingrid Van Keilegom
- Research Center for Operations Research and Statistics, KU Leuven, Leuven, Belgium
| | - Xiaoguang Wang
- School of Mathematical Sciences, Dalian University of Technology, Liaoning, China
| |
Collapse
|
3
|
Abstract
The hazard function plays a central role in survival analysis. In a homogeneous population, the distribution of the time to event, described by the hazard, is the same for each individual. Heterogeneity in the distributions can be accounted for by including covariates in a model for the hazard, for instance a proportional hazards model. In this model, individuals with the same value of the covariates will have the same distribution. It is natural to think that not all covariates that are thought to influence the distribution of the survival outcome are included in the model. This implies that there is unobserved heterogeneity; individuals with the same value of the covariates may have different distributions. One way of accounting for this unobserved heterogeneity is to include random effects in the model. In the context of hazard models for time to event outcomes, such random effects are called frailties, and the resulting models are called frailty models. In this tutorial, we study frailty models for survival outcomes. We illustrate how frailties induce selection of healthier individuals among survivors, and show how shared frailties can be used to model positively dependent survival outcomes in clustered data. The Laplace transform of the frailty distribution plays a central role in relating the hazards, conditional on the frailty, to hazards and survival functions observed in a population. Available software, mainly in R, will be discussed, and the use of frailty models is illustrated in two different applications, one on center effects and the other on recurrent events.
Collapse
Affiliation(s)
- Theodor A Balan
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, The Netherlands
| | - Hein Putter
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, The Netherlands
| |
Collapse
|
4
|
Alotaibi RM, Rezk HR, Guure C. Bayesian frailty modeling of correlated survival data with application to under-five mortality. BMC Public Health 2020; 20:1429. [PMID: 32957954 PMCID: PMC7504601 DOI: 10.1186/s12889-020-09328-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2019] [Accepted: 08/03/2020] [Indexed: 11/24/2022] Open
Abstract
Background There is high rate of under-five mortality in West Africa with little effort made to study determinants that significantly increase or decrease its risk across the West African sub-region. This is important since it will help in the design of effective intervention programs for each country or the entire region. The overall objective of this research evaluates the determinants of under-five mortality prior to the end of the 2015 Millennium Development Goals, to guide West African countries implement strategies that will aid them achieve the Sustainable Development Goal 3 by 2030. Method This study used the Demographic and Health Survey (DHS) data from twelve (12) out of the eighteen West African countries; Ghana, Benin, Cote d’ Ivoire, Guinea, Liberia, Mali, Niger, Nigeria, Sierra Leone, Burkina Faso, Gambia and Togo. Data were extracted from the children and women of reproductive age files as provided in the DHS report. The response or outcome variable of interest is under-five mortality rate. A Bayesian exponential, Weibull and Gompertz regression models via a gamma shared frailty model were used for the analysis. The deviance information criteria and Bayes factors were used to discriminate between models. These analyses were carried out using Stata version 15 software. Results The study recorded 101 (95% CI: 98.6–103.5) deaths per 1000 live births occurring among the twelve countries. Burkina Faso (124.4), Cote D’lvoire (110.1), Guinea (116.4), Nigeria (120.6) and Niger (118.3) recorded the highest child under-5 mortality rate. Gambia (48.1), Ghana (60.1) and Benin (70.4) recorded the least unde-5 mortality rate per 1000 livebirths. Multiple birth children were about two times more likely to die compared to singleton birth, in all except Gambia, Nigeria and Sierra Leone. We observed significantly higher hazard rates for male compared to female children in the combined data analysis (HR: 1.14, 95% CI: [1.10–1.18]). The country specific analysis in Benin, Cote D’lvoire, Guinea, Liberia, Mali and Nigeria showed higher under-5 mortality hazard rates among male children compared to female children whilst Niger was the only country to report significantly lower hazard rate of males compared to females. Conclusion There is still quite a substantial amount of work to be done in order to meet the Sustainable Development Goal 3 in 2030 in West Africa. There exist variant differences among some of the countries with respect to mortality rates and determinants which require different interventions and policy decisions.
Collapse
Affiliation(s)
- Refah M Alotaibi
- Mathematical Sciences Department, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
| | - Hoda Ragab Rezk
- Mathematical Sciences Department, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia.,Department of statistics, Al-Azhar University, Cairo, Egypt
| | - Chris Guure
- Department of Biostatistics, School of Public Health, University of Ghana, Accra, Ghana.
| |
Collapse
|
5
|
Gasperoni F, Ieva F, Paganoni AM, Jackson CH, Sharples L. Non-parametric frailty Cox models for hierarchical time-to-event data. Biostatistics 2020; 21:531-544. [PMID: 30590499 PMCID: PMC6451633 DOI: 10.1093/biostatistics/kxy071] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2018] [Revised: 10/15/2018] [Accepted: 10/16/2018] [Indexed: 11/14/2022] Open
Abstract
We propose a novel model for hierarchical time-to-event data, for example, healthcare data in which patients are grouped by their healthcare provider. The most common model for this kind of data is the Cox proportional hazard model, with frailties that are common to patients in the same group and given a parametric distribution. We relax the parametric frailty assumption in this class of models by using a non-parametric discrete distribution. This improves the flexibility of the model by allowing very general frailty distributions and enables the data to be clustered into groups of healthcare providers with a similar frailty. A tailored Expectation-Maximization algorithm is proposed for estimating the model parameters, methods of model selection are compared, and the code is assessed in simulation studies. This model is particularly useful for administrative data in which there are a limited number of covariates available to explain the heterogeneity associated with the risk of the event. We apply the model to a clinical administrative database recording times to hospital readmission, and related covariates, for patients previously admitted once to hospital for heart failure, and we explore latent clustering structures among healthcare providers.
Collapse
Affiliation(s)
- Francesca Gasperoni
- MOX - Modelling and Scientific Computing, Department of Mathematics Politecnico di Milano, Piazza Leonardo Da Vinci 32, Milano 20123, Italy
| | - Francesca Ieva
- MOX - Modelling and Scientific Computing, Department of Mathematics Politecnico di Milano, Piazza Leonardo Da Vinci 32, Milano 20123, Italy
| | - Anna Maria Paganoni
- MOX - Modelling and Scientific Computing, Department of Mathematics Politecnico di Milano, Piazza Leonardo Da Vinci 32, Milano 20123, Italy
| | - Christopher H Jackson
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge CB2 0SR, UK
| | - Linda Sharples
- Department of Medical Statistics, London School of Hygiene & Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| |
Collapse
|
6
|
Florens JP, Simar L, Van Keilegom I. Estimation of the Boundary of a Variable Observed With Symmetric Error. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2018.1555093] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
| | - Léopold Simar
- Toulouse School of Economics, Université Toulouse Capitole
- ISBA, Université Catholique de Louvain, Ottignies-Louvain-la-Neuve, Belgium
| | - Ingrid Van Keilegom
- ISBA, Université Catholique de Louvain, Ottignies-Louvain-la-Neuve, Belgium
- ORSTAT, KU Leuven, Leuven, Belgium
| |
Collapse
|
7
|
Wen CC, Chen YH. Pseudo and conditional score approach to joint analysis of current count and current status data. Biometrics 2018; 74:1223-1231. [PMID: 29665618 DOI: 10.1111/biom.12880] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Revised: 01/01/2018] [Accepted: 02/01/2018] [Indexed: 11/28/2022]
Abstract
We develop a joint analysis approach for recurrent and nonrecurrent event processes subject to case I interval censorship, which are also known in literature as current count and current status data, respectively. We use a shared frailty to link the recurrent and nonrecurrent event processes, while leaving the distribution of the frailty fully unspecified. Conditional on the frailty, the recurrent event is assumed to follow a nonhomogeneous Poisson process, and the mean function of the recurrent event and the survival function of the nonrecurrent event are assumed to follow some general form of semiparametric transformation models. Estimation of the models is based on the pseudo-likelihood and the conditional score techniques. The resulting estimators for the regression parameters and the unspecified baseline functions are shown to be consistent with rates of square and cubic roots of the sample size, respectively. Asymptotic normality with closed-form asymptotic variance is derived for the estimator of the regression parameters. We apply the proposed method to a fracture-osteoporosis survey data to identify risk factors jointly for fracture and osteoporosis in elders, while accounting for association between the two events within a subject.
Collapse
Affiliation(s)
- Chi-Chung Wen
- Department of Mathematics, Tamkang University, Taiwan
| | - Yi-Hau Chen
- Institute of Statistical Science, Academia Sinica, Taiwan
| |
Collapse
|
8
|
Geerdens C, Acar EF, Janssen P. Conditional copula models for right-censored clustered event time data. Biostatistics 2017; 19:247-262. [DOI: 10.1093/biostatistics/kxx034] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 07/14/2017] [Indexed: 11/13/2022] Open
Affiliation(s)
- Candida Geerdens
- Center for Statistics, Hasselt University, Agoralaan Building D, B-3590 Diepenbeek, Belgium
| | - Elif Fidan Acar
- Department of Statistics, University of Manitoba, 186 Dysart Road, Winnipeg, Manitoba R3T 2N2, Canada
| | - Paul Janssen
- Center for Statistics, Hasselt University, Agoralaan Building D, B-3590 Diepenbeek, Belgium
| |
Collapse
|
9
|
Garcia TP, Ma Y, Marder K, Wang Y. ROBUST MIXED EFFECTS MODEL FOR CLUSTERED FAILURE TIME DATA: APPLICATION TO HUNTINGTON'S DISEASE EVENT MEASURES. Ann Appl Stat 2017; 11:1085-1116. [PMID: 29399240 PMCID: PMC5793916 DOI: 10.1214/17-aoas1038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
An important goal in clinical and statistical research is properly modeling the distribution for clustered failure times which have a natural intraclass dependency and are subject to censoring. We handle these challenges with a novel approach that does not impose restrictive modeling or distributional assumptions. Using a logit transformation, we relate the distribution for clustered failure times to covariates and a random, subject-specific effect. The covariates are modeled with unknown functional forms, and the random effect may depend on the covariates and have an unknown and unspecified distribution. We introduce pseudovalues to handle censoring and splines for functional covariate effects, and frame the problem into fitting an additive logistic mixed effects model. Unlike existing approaches for fitting such models, we develop semiparametric techniques that estimate the functional model parameters without specifying or estimating the random effect distribution. We show both theoretically and empirically that the resulting estimators are consistent for any choice of random effect distribution and any dependency structure between the random effect and covariates. Last, we illustrate the method's utility in an application to a Huntington's disease study where our method provides new insights into differences between motor and cognitive impairment event times in at-risk subjects.
Collapse
|
10
|
Balan TA, Boonk SE, Vermeer MH, Putter H. Score test for association between recurrent events and a terminal event. Stat Med 2016; 35:3037-48. [PMID: 26891109 DOI: 10.1002/sim.6913] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Revised: 01/07/2016] [Accepted: 01/27/2016] [Indexed: 11/09/2022]
Abstract
The statistical analysis of recurrent events relies on the assumption of independent censoring. When random effects are used, this means, in addition, that the censoring cannot depend on the random effect. Whenever the recurrent event process is terminated by death, this assumption might not be satisfied. Because joint models arising from such situations are more difficult to fit and interpret, clinicians rarely check whether joint modeling is preferred. In this paper, we propose and compare simple, yet efficient methods for testing whether the terminal event and the recurrent events are associated or not. The performance of the proposed methods is evaluated in a simulation study, and the sensitivity to misspecification of the model is assessed. Finally, the methods are illustrated on a data set comprising repeated observations of skin tumors on T-cell lymphoma patients. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Theodor-Adrian Balan
- Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands
| | - Stephanie E Boonk
- Department of Dermatology, Leiden University Medical Center, Leiden, The Netherlands
| | - Maarten H Vermeer
- Department of Dermatology, Leiden University Medical Center, Leiden, The Netherlands
| | - Hein Putter
- Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
11
|
Emura T, Nakatochi M, Murotani K, Rondeau V. A joint frailty-copula model between tumour progression and death for meta-analysis. Stat Methods Med Res 2015; 26:2649-2666. [DOI: 10.1177/0962280215604510] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Dependent censoring often arises in biomedical studies when time to tumour progression (e.g., relapse of cancer) is censored by an informative terminal event (e.g., death). For meta-analysis combining existing studies, a joint survival model between tumour progression and death has been considered under semicompeting risks, which induces dependence through the study-specific frailty. Our paper here utilizes copulas to generalize the joint frailty model by introducing additional source of dependence arising from intra-subject association between tumour progression and death. The practical value of the new model is particularly evident for meta-analyses in which only a few covariates are consistently measured across studies and hence there exist residual dependence. The covariate effects are formulated through the Cox proportional hazards model, and the baseline hazards are nonparametrically modeled on a basis of splines. The estimator is then obtained by maximizing a penalized log-likelihood function. We also show that the present methodologies are easily modified for the competing risks or recurrent event data, and are generalized to accommodate left-truncation. Simulations are performed to examine the performance of the proposed estimator. The method is applied to a meta-analysis for assessing a recently suggested biomarker CXCL12 for survival in ovarian cancer patients. We implement our proposed methods in R joint.Cox package.
Collapse
Affiliation(s)
- Takeshi Emura
- Graduate Institute of Statistics, National Central University, Jhongli City, Taoyuan, Taiwan
| | - Masahiro Nakatochi
- Center for Advanced Medicine and Clinical Research, Nagoya University Hospital, Japan
| | - Kenta Murotani
- Center for Advanced Medicine and Clinical Research, Nagoya University Hospital, Japan
| | - Virginie Rondeau
- INSERM CR897 (Biostatistic), Université Bordeaux Segalen, Bordeaux Cedex, France
| |
Collapse
|