1
|
Sun T, Cheng Y, Ding Y. An information ratio-based goodness-of-fit test for copula models on censored data. Biometrics 2023; 79:1713-1725. [PMID: 36440608 PMCID: PMC10225017 DOI: 10.1111/biom.13807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 11/10/2022] [Indexed: 11/29/2022]
Abstract
Copula is a popular method for modeling the dependence among marginal distributions in multivariate censored data. As many copula models are available, it is essential to check if the chosen copula model fits the data well for analysis. Existing approaches to testing the fitness of copula models are mainly for complete or right-censored data. No formal goodness-of-fit (GOF) test exists for interval-censored or recurrent events data. We develop a general GOF test for copula-based survival models using the information ratio (IR) to address this research gap. It can be applied to any copula family with a parametric form, such as the frequently used Archimedean, Gaussian, and D-vine families. The test statistic is easy to calculate, and the test procedure is straightforward to implement. We establish the asymptotic properties of the test statistic. The simulation results show that the proposed test controls the type-I error well and achieves adequate power when the dependence strength is moderate to high. Finally, we apply our method to test various copula models in analyzing multiple real datasets. Our method consistently separates different copula models for all these datasets in terms of model fitness.
Collapse
Affiliation(s)
- Tao Sun
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Yu Cheng
- Department of Statistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Ying Ding
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
2
|
Boukeloua M. Study of semiparametric copula models via divergences with bivariate censored data. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2020.1734834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Mohamed Boukeloua
- Laboratoire de Génie des Procédés pour le Développement Durable et les Produits de Santés (LGPDDPS), Ecole Nationale Polytechnique de Constantine, Constantine, Algeria
- Laboratoire LAMASD, Département de Mathématiques, Université Frères Mentouri, Constantine, Algeria
| |
Collapse
|
3
|
Shu D, He W. Perturbation‐based null hypothesis tests with an application to Clayton models. CAN J STAT 2021. [DOI: 10.1002/cjs.11612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Di Shu
- Department of Population Medicine Harvard Medical School and Harvard Pilgrim Health Care Institute Boston 02215 MA U.S.A
| | - Wenqing He
- Department of Statistical and Actuarial Sciences University of Western Ontario London ON Canada N6A 5B7
| |
Collapse
|
4
|
Parner ET, Andersen PK, Overgaard M. Cumulative risk regression in case-cohort studies using pseudo-observations. LIFETIME DATA ANALYSIS 2020; 26:639-658. [PMID: 31933047 DOI: 10.1007/s10985-020-09492-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 01/03/2020] [Indexed: 06/10/2023]
Abstract
Case-cohort studies are useful when information on certain risk factors is difficult or costly to ascertain. Particularly, a case-cohort study may be well suited in situations where several case series are of interest, e.g. in studies with competing risks, because the same sub-cohort may serve as a comparison group for all case series. Previous analyses of this kind of sampled cohort data most often involved estimation of rate ratios based on a Cox regression model. However, with competing risks this method will not provide parameters that directly describe the association between covariates and cumulative risks. In this paper, we study regression analysis of cause-specific cumulative risks in case-cohort studies using pseudo-observations. We focus mainly on the situation with competing risks. However, as a by-product, we also develop a method by which absolute mortality risks may be analyzed directly from case-cohort survival data. We adjust for the case-cohort sampling by inverse sampling probabilities applied to a generalized estimation equation. The large-sample properties of the proposed estimator are developed and small-sample properties are evaluated in a simulation study. We apply the methodology to study the effect of a specific diet component and a specific gene on the absolute risk of atrial fibrillation.
Collapse
Affiliation(s)
- Erik T Parner
- Section for Biostatistics, Aarhus University, Bartholins Allé 2, 8000, Aarhus C, Denmark.
| | - Per K Andersen
- Section of Biostatistics, University of Copenhagen, Øster Farimagsgade 5, 1014, Copenhagen K, Denmark
| | - Morten Overgaard
- Section for Biostatistics, Aarhus University, Bartholins Allé 2, 8000, Aarhus C, Denmark
| |
Collapse
|
5
|
Huang Y, Zhang L, Lian G, Zhan R, Xu R, Huang Y, Mitra B, Wu J, Luo G. A novel mathematical model to predict prognosis of burnt patients based on logistic regression and support vector machine. Burns 2016; 42:291-9. [PMID: 26774603 DOI: 10.1016/j.burns.2015.08.009] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2015] [Revised: 07/09/2015] [Accepted: 08/07/2015] [Indexed: 11/25/2022]
Abstract
OBJECTIVE To develop a mathematical model of predicting mortality based on the admission characteristics of 6220 burn cases. METHODS Data on all the burn patients presenting to Institute of Burn Research, Southwest Hospital, Third Military Medical University from January of 1999 to December of 2008 were extracted from the departmental registry. The distributions of burn cases were scattered by principal component analysis. Univariate associations with mortality were identified and independent associations were derived from multivariate logistic regression analysis. Using variables independently and significantly associated with mortality, a mathematical model to predict mortality was developed using the support vector machine (SVM) model. The predicting ability of this model was evaluated and verified. RESULTS The overall mortality in this study was 1.8%. Univariate associations with mortality were identified and independent associations were derived from multivariate logistic regression analysis. Variables at admission independently associated with mortality were gender, age, total burn area, full thickness burn area, inhalation injury, shock, period before admission and others. The sensitivity and specificity of logistic model were 99.75% and 85.84% respectively, with an area under the receiver operating curve of 0.989 (95% CI: 0.979-1.000; p<0.01). The model correctly classified 99.50% of cases. The subsequently developed support vector machine (SVM) model correctly classified nearly 100% of test cases, which could not only predict adult group but also pediatric group, with pretty high robustness (92%-100%). CONCLUSION A mathematical model based on logistic regression and SVM could be used to predict the survival prognosis according to the admission characteristics.
Collapse
Affiliation(s)
- Yinghui Huang
- Institute of Burn Research, Southwest Hospital, Third Military Medical University, Chongqing, China; Institute of Combined Injury, State Key Laboratory of Trauma, Burns and Combined Injury, Chongqing Engineering Research Center for Nanomedicine, College of Preventive Medicine, Third Military Medical University, Chongqing, China; Department of Biochemistry and Molecular Biology, Third Military Medical University, Chongqing, China.
| | - Lei Zhang
- College of Communication Engineering, Chongqing University, Chongqing 400044, China.
| | - Guan Lian
- Institute of Burn Research, Southwest Hospital, Third Military Medical University, Chongqing, China.
| | - Rixing Zhan
- Institute of Burn Research, Southwest Hospital, Third Military Medical University, Chongqing, China.
| | - Rufu Xu
- The Department of Epidemiology, Third Military Medical University, Chongqing, China.
| | - Yan Huang
- Department of Biochemistry and Molecular Biology, Third Military Medical University, Chongqing, China.
| | - Biswadev Mitra
- Trauma Service Center, Alfred Hospital, 55 Commercial Road, Melbourne, VIC 3004, Australia.
| | - Jun Wu
- Institute of Burn Research, Southwest Hospital, Third Military Medical University, Chongqing, China.
| | - Gaoxing Luo
- Institute of Burn Research, Southwest Hospital, Third Military Medical University, Chongqing, China.
| |
Collapse
|
6
|
Ning J, Bandeen-Roche K. Estimation of time-dependent association for bivariate failure times in the presence of a competing risk. Biometrics 2013; 70:10-20. [PMID: 24350628 DOI: 10.1111/biom.12110] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Revised: 08/01/2013] [Accepted: 09/01/2013] [Indexed: 11/29/2022]
Abstract
This article targets the estimation of a time-dependent association measure for bivariate failure times, the conditional cause-specific hazards ratio (CCSHR), which is a generalization of the conditional hazards ratio (CHR) to accommodate competing risks data. We model the CCSHR as a parametric regression function of time and event causes and leave all other aspects of the joint distribution of the failure times unspecified. We develop a pseudo-likelihood estimation procedure for model fitting and inference and establish the asymptotic properties of the estimators. We assess the finite-sample properties of the proposed estimators against the estimators obtained from a moment-based estimating equation approach. Data from the Cache County study on dementia are used to illustrate the proposed methodology.
Collapse
Affiliation(s)
- Jing Ning
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, U.S.A
| | | |
Collapse
|
7
|
Geerdens C, Claeskens G, Janssen P. Goodness-of-fit tests for the frailty distribution in proportional hazards models with shared frailty. Biostatistics 2012; 14:433-46. [DOI: 10.1093/biostatistics/kxs053] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
8
|
Yilmaz YE, Lawless JF. Likelihood ratio procedures and tests of fit in parametric and semiparametric copula models with censored data. LIFETIME DATA ANALYSIS 2011; 17:386-408. [PMID: 21279545 DOI: 10.1007/s10985-011-9192-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2010] [Accepted: 01/17/2011] [Indexed: 05/30/2023]
Abstract
Copula models for multivariate lifetimes have become widely used in areas such as biomedicine, finance and insurance. This paper fills some gaps in existing methodology for copula parameters and model assessment. We consider procedures based on likelihood and pseudolikelihood ratio statistics and introduce semiparametric maximum likelihood estimation leading to semiparametric versions. For cases where standard asymptotic approximations do not hold, we propose an efficient simulation technique for obtaining p-values. We apply these methods to tests for a copula model, based on embedding it in a larger copula family. It is shown that the likelihood and pseudolikelihood ratio tests are consistent even when the expanded copula model is misspecified. Power comparisons with two other tests of fit indicate that model expansion provides a convenient, powerful and robust approach. The methods are illustrated on an application concerning the time to loss of vision in the two eyes of an individual.
Collapse
Affiliation(s)
- Yildiz E Yilmaz
- Prosserman Centre for Health Research, Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.
| | | |
Collapse
|
9
|
Petersen L, Sørensen TIA, Andersen PK. A shared frailty model for case-cohort samples: Parent and offspring relations in an adoption study. Stat Med 2010; 29:924-31. [DOI: 10.1002/sim.3729] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
10
|
Massonnet G, Janssen P, Duchateau L. Modelling udder infection data using copula models for quadruples. J Stat Plan Inference 2009. [DOI: 10.1016/j.jspi.2009.05.025] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
11
|
Cook RJ, Tolusso D. Second-order estimating equations for the analysis of clustered current status data. Biostatistics 2009; 10:756-72. [DOI: 10.1093/biostatistics/kxp029] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
12
|
Beaudoin D, Lakhal-Chaieb L. Archimedean copula model selection under dependent truncation. Stat Med 2008; 27:4440-54. [DOI: 10.1002/sim.3316] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
13
|
Huang X, Zhang N. Regression survival analysis with an assumed copula for dependent censoring: a sensitivity analysis approach. Biometrics 2008; 64:1090-9. [PMID: 18266895 DOI: 10.1111/j.1541-0420.2008.00986.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
SUMMARY In clinical studies, when censoring is caused by competing risks or patient withdrawal, there is always a concern about the validity of treatment effect estimates that are obtained under the assumption of independent censoring. Because dependent censoring is nonidentifiable without additional information, the best we can do is a sensitivity analysis to assess the changes of parameter estimates under different assumptions about the association between failure and censoring. This analysis is especially useful when knowledge about such association is available through literature review or expert opinions. In a regression analysis setting, the consequences of falsely assuming independent censoring on parameter estimates are not clear. Neither the direction nor the magnitude of the potential bias can be easily predicted. We provide an approach to do sensitivity analysis for the widely used Cox proportional hazards models. The joint distribution of the failure and censoring times is assumed to be a function of their marginal distributions. This function is called a copula. Under this assumption, we propose an iteration algorithm to estimate the regression parameters and marginal survival functions. Simulation studies show that this algorithm works well. We apply the proposed sensitivity analysis approach to the data from an AIDS clinical trial in which 27% of the patients withdrew due to toxicity or at the request of the patient or investigator.
Collapse
Affiliation(s)
- Xuelin Huang
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA.
| | | |
Collapse
|