1
|
Ghosal R, Matabuena M, Zhang J. Functional proportional hazards mixture cure model with applications in cancer mortality in NHANES and post ICU recovery. Stat Methods Med Res 2023; 32:2254-2269. [PMID: 37855203 DOI: 10.1177/09622802231206472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2023]
Abstract
We develop a functional proportional hazards mixture cure model with scalar and functional covariates measured at the baseline. The mixture cure model, useful in studying populations with a cure fraction of a particular event of interest is extended to functional data. We employ the expectation-maximization algorithm and develop a semiparametric penalized spline-based approach to estimate the dynamic functional coefficients of the incidence and the latency part. The proposed method is computationally efficient and simultaneously incorporates smoothness in the estimated functional coefficients via roughness penalty. Simulation studies illustrate a satisfactory performance of the proposed method in accurately estimating the model parameters and the baseline survival function. Finally, the clinical potential of the model is demonstrated in two real data examples that incorporate rich high-dimensional biomedical signals as functional covariates measured at the baseline and constitute novel domains to apply cure survival models in contemporary medical situations. In particular, we analyze (i) minute-by-minute physical activity data from the National Health And Nutrition Examination Survey 2003-2006 to study the association between diurnal patterns of physical activity at baseline and all cancer mortality through 2019 while adjusting for other biological factors; (ii) the impact of daily functional measures of disease severity collected in the intensive care unit on post intensive care unit recovery and mortality event. Our findings provide novel epidemiological insights into the association between daily patterns of physical activity and cancer mortality. Software implementation and illustration of the proposed estimation method are provided in R.
Collapse
Affiliation(s)
- Rahul Ghosal
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, SC, USA
| | - Marcos Matabuena
- Department of Biostatistics, Harvard University T. H. Chan School of Public Health, Boston, MA, USA
| | - Jiajia Zhang
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, SC, USA
| |
Collapse
|
2
|
Wang X, Claggett BL, Tian L, Malachias MVB, Pfeffer MA, Wei LJ. Quantifying and Interpreting the Prediction Accuracy of Models for the Time of a Cardiovascular Event-Moving Beyond C Statistic: A Review. JAMA Cardiol 2023; 8:290-295. [PMID: 36723915 PMCID: PMC10660575 DOI: 10.1001/jamacardio.2022.5279] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Importance For personalized or stratified medicine, it is critical to establish a reliable and efficient prediction model for a clinical outcome of interest. The goal is to develop a parsimonious model with fewer predictors for broad future application without compromising predictability. A general approach is to construct various empirical models via individual patients' specific baseline characteristics/biomarkers and then evaluate their relative merits. When the outcome of interest is the timing of a cardiovascular event, a commonly used metric to assess the adequacy of the fitted models is based on C statistics. These measures quantify a model's ability to separate those who develop events earlier from those who develop them later or not at all (discrimination), but they do not measure how closely model estimates match observed outcomes (prediction accuracy). Metrics that provide clinically interpretable measures to quantify prediction accuracy are needed. Observations C statistics measure the concordance between the risk scores derived from the model and the observed event time observations. However, C statistics do not quantify the model prediction accuracy. The integrated Brier Score, which calculates the mean squared distance between the empirical cumulative event-free curve and its individual patient's counterparts, estimates the prediction accuracy, but it is not clinically intuitive. A simple alternative measure is the average distance between the observed and predicted event times over the entire study population. This metric directly quantifies the model prediction accuracy and has often been used to evaluate the goodness of fit of the assumed models in settings other than survival data. This time-scale measure is easier to interpret than the C statistics or the Brier score. Conclusions and Relevance This article enhances our understanding of the model selection/evaluation process with respect to prediction accuracy. A simple, intuitive measure for quantifying such accuracy beyond C statistics can improve the reliability and efficiency of the selected model for personalized and stratified medicine.
Collapse
Affiliation(s)
- Xuan Wang
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
| | - Brian Lee Claggett
- Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Harvard University, Boston, Massachusetts
| | - Lu Tian
- Department of Biomedical Data Science, Stanford University, Stanford, California
| | | | - Marc A Pfeffer
- Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Harvard University, Boston, Massachusetts
| | - Lee-Jen Wei
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
| |
Collapse
|
3
|
Wang S, Xu D, Wang C, Sun J. Estimation of linear transformation cure models with informatively interval-censored failure time data. J Nonparametr Stat 2022. [DOI: 10.1080/10485252.2022.2148667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Shuying Wang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, People's Republic of Chin
| | - Da Xu
- Key Laboratory of Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| | - Chunjie Wang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, People's Republic of Chin
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, MO, USA
| |
Collapse
|
4
|
Jin J, Song X, Sun L. Dynamic semiparametric transformation models for recurrent event data with a terminal event. Stat Med 2022; 41:5432-5447. [PMID: 36121319 DOI: 10.1002/sim.9577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 08/01/2022] [Accepted: 09/05/2022] [Indexed: 11/06/2022]
Abstract
Recurrent event data with a terminal event commonly arise in many longitudinal follow-up studies. This article proposes a class of dynamic semiparametric transformation models for the marginal mean functions of the recurrent events with a terminal event, where some covariate effects may be time-varying. An estimation procedure is developed for the model parameters, and the asymptotic properties of the resulting estimators are established. In addition, relevant significance tests are suggested for examining whether or not covariate effects vary with time, and a model checking procedure is presented for assessing the adequacy of the proposed models. The finite sample performance of the proposed estimators is examined through simulation studies, and an application to a medical cost study of chronic heart failure patients is provided.
Collapse
Affiliation(s)
- Jin Jin
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, China
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| | - Liuquan Sun
- Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
5
|
He B, Ma S, Zhang X, Zhu LX. Rank-Based Greedy Model Averaging for High-Dimensional Survival Data. J Am Stat Assoc 2022; 118:2658-2670. [PMID: 39552724 PMCID: PMC11566305 DOI: 10.1080/01621459.2022.2070070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 04/08/2022] [Indexed: 10/18/2022]
Abstract
Model averaging is an effective way to enhance prediction accuracy. However, most previous works focus on low-dimensional settings with completely observed responses. To attain an accurate prediction for the risk effect of survival data with high-dimensional predictors, we propose a novel method: rank-based greedy (RG) model averaging. Specifically, adopting the transformation model with splitting predictors as working models, we doubly use the smooth concordance index function to derive the candidate predictions and optimal model weights. The final prediction is achieved by weighted averaging all the candidates. Our approach is flexible, computationally efficient, and robust against model misspecification, as it neither requires the correctness of a joint model nor involves the estimation of the transformation function. We further adopt the greedy algorithm for high dimensions. Theoretically, we derive an asymptotic error bound for the optimal weights under some mild conditions. In addition, the summation of weights assigned to the correct candidate submodels is proven to approach one in probability when there are correct models included among the candidate submodels. Extensive numerical studies are carried out using both simulated and real datasets to show the proposed approach's robust performance compared to the existing regularization approaches. Supplementary materials for this article are available online.
Collapse
Affiliation(s)
- Baihua He
- International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, China
| | - Shuangge Ma
- Department of Biostatistics, Yale University, New Haven, CT
| | - Xinyu Zhang
- International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, China
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Li-Xing Zhu
- Center for Statistics and Data Science, Beijing Normal University at Zhuhai, Zhuhai, China
- Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| |
Collapse
|
6
|
Liang L, Hou J, Uno H, Cho K, Ma Y, Cai T. Semi-supervised approach to event time annotation using longitudinal electronic health records. LIFETIME DATA ANALYSIS 2022; 28:428-491. [PMID: 35753014 PMCID: PMC10044535 DOI: 10.1007/s10985-022-09557-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 05/13/2022] [Indexed: 06/15/2023]
Abstract
Large clinical datasets derived from insurance claims and electronic health record (EHR) systems are valuable sources for precision medicine research. These datasets can be used to develop models for personalized prediction of risk or treatment response. Efficiently deriving prediction models using real world data, however, faces practical and methodological challenges. Precise information on important clinical outcomes such as time to cancer progression are not readily available in these databases. The true clinical event times typically cannot be approximated well based on simple extracts of billing or procedure codes. Whereas, annotating event times manually is time and resource prohibitive. In this paper, we propose a two-step semi-supervised multi-modal automated time annotation (MATA) method leveraging multi-dimensional longitudinal EHR encounter records. In step I, we employ a functional principal component analysis approach to estimate the underlying intensity functions based on observed point processes from the unlabeled patients. In step II, we fit a penalized proportional odds model to the event time outcomes with features derived in step I in the labeled data where the non-parametric baseline function is approximated using B-splines. Under regularity conditions, the resulting estimator of the feature effect vector is shown as root-n consistent. We demonstrate the superiority of our approach relative to existing approaches through simulations and a real data example on annotating lung cancer recurrence in an EHR cohort of lung cancer patients from Veteran Health Administration.
Collapse
Affiliation(s)
- Liang Liang
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Jue Hou
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Hajime Uno
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Kelly Cho
- Massachusetts Veterans Epidemiology Research and Information Center, US Department of Veteran Affairs, Boston, MA, USA
- Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Yanyuan Ma
- Department of Statistics, Penn State University, University Park, PA, Boston, USA
| | - Tianxi Cai
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
7
|
O'Quigley J. Testing for Differences in Survival When Treatment Effects Are Persistent, Decaying, or Delayed. J Clin Oncol 2022; 40:3537-3545. [PMID: 35767775 DOI: 10.1200/jco.21.01811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A statistical test for the presence of treatment effects on survival will be based on a null hypothesis (absence of effects) and an alternative (presence of effects). The null is very simply expressed. The most common alternative, also simply expressed, is that of proportional hazards. For this situation, not only do we have a very powerful test in the log-rank test but also the outcome is readily interpreted. However, many modern treatments fall outside this relatively straightforward paradigm and, as such, have attracted attention from statisticians eager to do their best to avoid losing power as well as to maintain interpretability when the alternative hypothesis is less simple. Examples include trials where the treatment effect decays with time, immunotherapy trials where treatment effects may be slow to manifest themselves as well as the so-called crossing hazards problem. We review some of the solutions that have been proposed to deal with these issues. We pay particular attention to the integrated log-rank test and how it can be combined with the log-rank test itself to obtain powerful tests for these more complex situations.
Collapse
Affiliation(s)
- John O'Quigley
- Department of Statistical Science, University College London, London, United Kingdom
| |
Collapse
|
8
|
Wang C, Jiang J, Song X. Bayesian transformation models with partly interval-censored data. Stat Med 2021; 41:1263-1279. [PMID: 34845732 DOI: 10.1002/sim.9271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 10/27/2021] [Accepted: 11/04/2021] [Indexed: 11/07/2022]
Abstract
In many scientific fields, partly interval-censored data, which consist of exactly observed and interval-censored observations on the failure time of interest, appear frequently. However, methodological developments in the analysis of partly interval-censored data are relatively limited and have mainly focused on additive or proportional hazards models. The general linear transformation model provides a highly flexible modeling framework that includes several familiar survival models as special cases. Despite such nice features, the inference procedure for this class of models has not been developed for partly interval-censored data. We propose a fully Bayesian approach coped with efficient Markov chain Monte Carlo methods to fill this gap. A four-stage data augmentation procedure is introduced to tackle the challenges presented by the complex model and data structure. The proposed method is easy to implement and computationally attractive. The empirical performance of the proposed method is evaluated through two simulation studies, and the model is then applied to a dental health study.
Collapse
Affiliation(s)
- Chunjie Wang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, China
| | - Jingjing Jiang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, China
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
9
|
Zhu Y, Li L, Huang X. Landmark Linear Transformation Model for Dynamic Prediction with Application to a Longitudinal Cohort Study of Chronic Disease. J R Stat Soc Ser C Appl Stat 2019; 68:771-791. [PMID: 31467454 PMCID: PMC6715145 DOI: 10.1111/rssc.12334] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Dynamic prediction of the risk of a clinical event using longitudinally measured biomarkers or other prognostic information is important in clinical practice. We propose a new class of landmark survival models. The model takes the form of a linear transformation model, but allows all the model parameters to vary with the landmark time. This model includes many published landmark prediction models as special cases. We propose a unified local linear estimation framework to estimate time-varying model parameters. Simulation studies are conducted to evaluate the finite sample performance of the proposed method. We apply the methodology to a dataset from the African American Study of Kidney Disease and Hypertension and predict individual patient's risk of an adverse clinical event.
Collapse
Affiliation(s)
- Yayuan Zhu
- University of Western Ontario, London, Canada
| | - Liang Li
- University of Texas MD Anderson Cancer Center, Houston, USA
| | - Xuelin Huang
- University of Texas MD Anderson Cancer Center, Houston, USA
| |
Collapse
|
10
|
Large-margin learning of Cox proportional hazard models for survival analysis. APPL INTELL 2018. [DOI: 10.1007/s10489-018-1363-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
11
|
Ouyang M, Wang X, Wang C, Song X. Bayesian semiparametric failure time models for multivariate censored data with latent variables. Stat Med 2018; 37:4279-4297. [DOI: 10.1002/sim.7916] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Revised: 04/13/2018] [Accepted: 06/27/2018] [Indexed: 11/11/2022]
Affiliation(s)
- Ming Ouyang
- Shenzhen Reseach Institute and Department of Statistics; The Chinese University of Hong Kong; Hong Kong
| | - Xiaoqing Wang
- Shenzhen Reseach Institute and Department of Statistics; The Chinese University of Hong Kong; Hong Kong
| | - Chunjie Wang
- School of Mathematics and Statistics; Changchun University of Technology; Changchun China
| | - Xinyuan Song
- Shenzhen Reseach Institute and Department of Statistics; The Chinese University of Hong Kong; Hong Kong
| |
Collapse
|
12
|
Wang H, Dai H, Restaino M, Bao Y. Linear transformation models for censored data under truncation. J Stat Plan Inference 2018. [DOI: 10.1016/j.jspi.2017.07.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
13
|
Kwak M. Estimation and inference on the joint conditional distribution for bivariate longitudinal data using Gaussian copula. J Korean Stat Soc 2017. [DOI: 10.1016/j.jkss.2016.11.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
14
|
Affiliation(s)
- Torsten Hothorn
- Institut für Epidemiologie, Biostatistik und Prävention Universität Zürich
| | - Lisa Möst
- Institut für Statistik Ludwig‐Maximilians‐Universität München
| | | |
Collapse
|
15
|
Zheng Y, Brown M, Lok A, Cai T. IMPROVING EFFICIENCY IN BIOMARKER INCREMENTAL VALUE EVALUATION UNDER TWO-PHASE DESIGNS. Ann Appl Stat 2017; 11:638-654. [PMID: 28943991 PMCID: PMC5604898 DOI: 10.1214/16-aoas997] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Cost-effective yet efficient designs are critical to the success of biomarker evaluation research. Two-phase sampling designs, under which expensive markers are only measured on a subsample of cases and non-cases within a prospective cohort, are useful in novel biomarker studies for preserving study samples and minimizing cost of biomarker assaying. Statistical methods for quantifying the predictiveness of biomarkers under two-phase studies have been proposed (Cai and Zheng, 2012; Liu, Cai and Zheng, 2012). These methods are based on a class of inverse probability weighted (IPW) estimators where weights are 'true' sampling weights that simply reflect the sampling strategy of the study. While simple to implement, existing IPW estimators are limited by lack of practicality and efficiency. In this manuscript, we investigate a variety of two-phase design options and provide statistical approaches aimed at improving the efficiency of simple IPW estimators by incorporating auxiliary information available for the entire cohort. We consider accuracy summary estimators that accommodate auxiliary information in the context of evaluating the incremental values of novel biomarkers over existing prediction tools. In addition, we evaluate the relative efficiency of a variety of sampling and estimation options under two-phase studies, shedding light on issues pertaining to both the design and analysis of biomarker validation studies. We apply our methods to the evaluation of a novel biomarker for liver cancer risk conducted with a two-phase nested case control design (Lok et al., 2010).
Collapse
Affiliation(s)
- Yingye Zheng
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| | - Marshall Brown
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| | - Anna Lok
- Division of Gastroenterology, University of Michigan Ann Arbor, MI 48109
| | - Tianxi Cai
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115
| |
Collapse
|
16
|
Kwak M. Estimation and inference of the joint conditional distribution for multivariate longitudinal data using nonparametric copulas. J Nonparametr Stat 2017. [DOI: 10.1080/10485252.2017.1324966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Minjung Kwak
- Department of Statistics, Yeungnam University, South Korea
| |
Collapse
|
17
|
Li S. Estimating time-dependent ROC curves using data under prevalent sampling. Stat Med 2017; 36:1285-1301. [PMID: 27891650 DOI: 10.1002/sim.7184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2016] [Revised: 10/09/2016] [Accepted: 11/03/2016] [Indexed: 11/10/2022]
Abstract
Prevalent sampling is frequently a convenient and economical sampling technique for the collection of time-to-event data and thus is commonly used in studies of the natural history of a disease. However, it is biased by design because it tends to recruit individuals with longer survival times. This paper considers estimation of time-dependent receiver operating characteristic curves when data are collected under prevalent sampling. To correct the sampling bias, we develop both nonparametric and semiparametric estimators using extended risk sets and the inverse probability weighting techniques. The proposed estimators are consistent and converge to Gaussian processes, while substantial bias may arise if standard estimators for right-censored data are used. To illustrate our method, we analyze data from an ovarian cancer study and estimate receiver operating characteristic curves that assess the accuracy of the composite markers in distinguishing subjects who died within 3-5 years from subjects who remained alive. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Shanshan Li
- Department of Biostatistics, Indiana University Fairbanks School of Public Health, Indianapolis, 46202, IN, U.S.A
| |
Collapse
|
18
|
Deng S, Liu KY, Zhao X. Semiparametric regression analysis of multivariate longitudinal data with informative observation times. Comput Stat Data Anal 2017. [DOI: 10.1016/j.csda.2016.10.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
19
|
Liu D, Cai T, Lok A, Zheng Y. Nonparametric Maximum Likelihood Estimators of Time-Dependent Accuracy Measures for Survival Outcome Under Two-Stage Sampling Designs. J Am Stat Assoc 2017; 113:882-892. [PMID: 30555194 PMCID: PMC6291304 DOI: 10.1080/01621459.2017.1295866] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 12/01/2016] [Indexed: 12/24/2022]
Abstract
Large prospective cohort studies of rare chronic diseases require thoughtful planning of study designs, especially for biomarker studies when measurements are based on stored tissue or blood specimens. Two-phase designs, including nested case-control (Thomas, 1977) and case-cohort (Prentice, 1986) sampling designs, provide cost-effective strategies for conducting biomarker evaluation studies. Existing literature for biomarker assessment under two-phase designs largely focuses on simple inverse probability weighting (IPW) estimators (Cai and Zheng, 2011; Liu et al., 2012). Drawing on recent theoretical development on the maximum likelihood estimators for relative risk parameters in two-phase studies (Scheike and Martinussen, 2004; Zeng et al., 2006), we propose nonparametric maximum likelihood based estimators to evaluate the accuracy and predictiveness of a risk prediction biomarker under both types of two-phase designs. In addition, hybrid estimators that combine IPW estimators and maximum likelihood estimation procedure are proposed to improve efficiency and alleviate computational burden. We derive large sample properties of proposed estimators and evaluate their finite sample performance using numerical studies. We illustrate new procedures using a two-phase biomarker study aiming to evaluate the accuracy of a novel biomarker, des-γ-carboxy prothrombin, for early detection of hepatocellular carcinoma (Lok et al., 2010).
Collapse
Affiliation(s)
- Dandan Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Tianxi Cai
- Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts 02115
| | - Anna Lok
- Division of Gastroenterology, University of Michigan, Ann Arbor, MI 48109
| | - Yingye Zheng
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| |
Collapse
|
20
|
Song XY, Pan D, Liu PF, Cai JH. Bayesian analysis of transformation latent variable models with multivariate censored data. Stat Methods Med Res 2016; 25:2337-2358. [DOI: 10.1177/0962280214522786] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Transformation latent variable models are proposed in this study to analyze multivariate censored data. The proposed models generalize conventional linear transformation models to semiparametric transformation models that accommodate latent variables. The characteristics of the latent variables were assessed based on several correlated observed indicators through measurement equations. A Bayesian approach was developed with Bayesian P-splines technique and the Markov chain Monte Carlo algorithm to estimate the unknown parameters and transformation functions. Simulation shows that the performance of the proposed methodology is satisfactory. The proposed method was applied to analyze a cardiovascular disease data set.
Collapse
Affiliation(s)
- Xin-Yuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong
| | - Deng Pan
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong
| | - Peng-Fei Liu
- School of Mathematics and Statistics, Jiangsu Normal University, Xuzhou, China
| | - Jing-Heng Cai
- Department of Statistics, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
21
|
Abstract
In survival analysis, the estimation of patient-specific survivor functions that are conditional on a set of patient characteristics is of special interest. In general, knowledge of the conditional survival probabilities of a patient at all relevant time points allows better assessment of the patient's risk than summary statistics, such as median survival time. Nevertheless, standard methods for analysing survival data seldom estimate the survivor function directly. Therefore, we propose the application of conditional transformation models (CTMs) for the estimation of the conditional distribution function of survival times given a set of patient characteristics. We used the inverse probability of censoring weighting approach to account for right-censored observations. Our proposed modelling approach allows the prediction of patient-specific survivor functions. In addition, CTMs constitute a flexible model class that is able to deal with proportional as well as non-proportional hazards. The well-known Cox model is included in the class of CTMs as a special case. We investigated the performance of CTMs in survival data analysis in a simulation that included proportional and non-proportional hazard settings and different scenarios of explanatory variables. Furthermore, we re-analysed the survival times of patients suffering from chronic myelogenous leukaemia and studied the impact of the proportional hazards assumption on previously published results.
Collapse
|
22
|
Qiu Z, Zhou Y. Partially linear transformation models with varying coefficients for multivariate failure time data. J MULTIVARIATE ANAL 2015. [DOI: 10.1016/j.jmva.2015.08.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
23
|
Shen PS. On Fitting Transformation Model to Survey Data. COMMUN STAT-THEOR M 2015. [DOI: 10.1080/03610926.2013.857689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
24
|
Shen PS. Semiparametric transformation models with random effects for clustered doubly censored data. J STAT COMPUT SIM 2015. [DOI: 10.1080/00949655.2014.880706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
25
|
Cheng YJ, Wang MC. Causal estimation using semiparametric transformation models under prevalent sampling. Biometrics 2015; 71:302-12. [PMID: 25715045 DOI: 10.1111/biom.12286] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 12/01/2014] [Accepted: 12/01/2014] [Indexed: 11/29/2022]
Abstract
This article presents methods and inference for causal estimation in semiparametric transformation models for the prevalent survival data. Through the estimation of the transformation models and covariate distribution, we propose a few analytical procedures to estimate the causal survival function. As the data are observational, the unobserved potential outcome (survival time) may be associated with the treatment assignment, and therefore there may exist a systematic imbalance between the data observed from each treatment arm. Further, due to prevalent sampling, subjects are observed only if they have not experienced the failure event when data collection began, causing the prevalent sampling bias. We propose a unified approach, which simultaneously corrects the bias from the prevalent sampling and balances the systematic differences from the observational data. We illustrate in the simulation study that standard analysis without proper adjustment would result in biased causal inference. Large sample properties of the proposed estimation procedures are established by techniques of empirical processes and examined by simulation studies. The proposed methods are applied to the Surveillance, Epidemiology, and End Results (SEER) and Medicare-linked data for women diagnosed with breast cancer.
Collapse
Affiliation(s)
- Yu-Jen Cheng
- Institute of Statistics, National Tsing Hua University, Hsin-Chu 300, Taiwan
| | - Mei-Cheng Wang
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland 21205, U.S.A
| |
Collapse
|
26
|
Geng Y, Lu W, Zhang HH. A Model-Free Machine Learning Method for Risk Classification and Survival Probability Prediction. Stat (Int Stat Inst) 2014; 3:337-350. [PMID: 25530636 DOI: 10.1002/sta4.67] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Risk classification and survival probability prediction are two major goals in survival data analysis since they play an important role in patients' risk stratification, long-term diagnosis, and treatment selection. In this article, we propose a new model-free machine learning framework for risk classification and survival probability prediction based on weighted support vector machines. The new procedure does not require any specific parametric or semiparametric model assumption on data, and is therefore capable of capturing nonlinear covariate effects. We use numerous simulation examples to demonstrate finite sample performance of the proposed method under various settings. Applications to a glioma tumor data and a breast cancer gene expression survival data are shown to illustrate the new methodology in real data analysis.
Collapse
Affiliation(s)
- Yuan Geng
- Boehringer Ingelheim International Trading Co., Ltd., Shanghai, 200040, P. R. China
| | - Wenbin Lu
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | - Hao Helen Zhang
- Department of Mathematics, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
27
|
Geng Y, Zhang HH, Lu W. On optimal treatment regimes selection for mean survival time. Stat Med 2014; 34:1169-84. [PMID: 25515005 DOI: 10.1002/sim.6397] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Revised: 09/18/2014] [Accepted: 12/01/2014] [Indexed: 11/11/2022]
Abstract
In clinical studies with time-to-event as a primary endpoint, one main interest is to find the best treatment strategy to maximize patients' mean survival time. Due to patient's heterogeneity in response to treatments, great efforts have been devoted to developing optimal treatment regimes by integrating individuals' clinical and genetic information. A main challenge arises in the selection of important variables that can help to build reliable and interpretable optimal treatment regimes as the dimension of predictors may be high. In this paper, we propose a robust loss-based estimation framework that can be easily coupled with shrinkage penalties for both estimation of optimal treatment regimes and variable selection. The asymptotic properties of the proposed estimators are studied. Moreover, a model-free estimator of restricted mean survival time under the derived optimal treatment regime is developed, and its asymptotic property is studied. Simulations are conducted to assess the empirical performance of the proposed method for parameter estimation, variable selection, and optimal treatment decision. An application to an AIDS clinical trial data set is given to illustrate the method.
Collapse
Affiliation(s)
- Yuan Geng
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, U.S.A
| | | | | |
Collapse
|
28
|
Qiu Z, Neeta M, Zhou Y. Weighted Estimator for the Linear Transformation Models with Multivariate Failure Time Data. COMMUN STAT-THEOR M 2014. [DOI: 10.1080/03610926.2013.844254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
29
|
|
30
|
Gu M, Wu Y, Huang B. Partial marginal likelihood estimation for general transformation models. J MULTIVARIATE ANAL 2014. [DOI: 10.1016/j.jmva.2013.08.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
31
|
Semiparametric regression analysis for clustered doubly-censored data. Comput Stat 2013. [DOI: 10.1007/s00180-013-0462-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
32
|
Zheng Y, Cai T, Pepe MS. Adopting nested case-control quota sampling designs for the evaluation of risk markers. LIFETIME DATA ANALYSIS 2013; 19:568-588. [PMID: 23807695 PMCID: PMC3903399 DOI: 10.1007/s10985-013-9270-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2012] [Accepted: 06/01/2013] [Indexed: 06/02/2023]
Abstract
Two-phase study methods, in which more detailed or more expensive exposure information is only collected on a sample of individuals with events and a small proportion of other individuals, are expected to play a critical role in biomarker validation research. One major limitation of standard two-phase designs is that they are most conveniently employed with study cohorts in which information on longitudinal follow-up and other potential matching variables is electronically recorded. However for many practical situations, at the sampling stage such information may not be readily available for every potential candidates. Study eligibility needs to be verified by reviewing information from medical charts one by one. In this manuscript, we study in depth a novel study design commonly undertaken in practice that involves sampling until quotas of eligible cases and controls are identified. We propose semiparametric methods to calculate risk distributions and a wide variety of prediction indices when outcomes are censored failure times and data are collected under the quota sampling design. Consistency and asymptotic normality of our estimators are established using empirical process theory. Simulation results indicate that the proposed procedures perform well in finite samples. Application is made to the evaluation of a new risk model for predicting the onset of cardiovascular disease.
Collapse
Affiliation(s)
- Yingye Zheng
- Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA, 98109, USA,
| | | | | |
Collapse
|
33
|
Wu CO, Tian X. Nonparametric Estimation of Conditional Distributions and Rank-Tracking Probabilities With Time-Varying Transformation Models in Longitudinal Studies. J Am Stat Assoc 2013. [DOI: 10.1080/01621459.2013.808949] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
34
|
|
35
|
Fan C, Fine JP. Linear Transformation Model With Parametric Covariate Transformations. J Am Stat Assoc 2013. [DOI: 10.1080/01621459.2013.770707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
36
|
Yang S. Semiparametric inference on the absolute risk reduction and the restricted mean survival difference. LIFETIME DATA ANALYSIS 2013; 19:219-241. [PMID: 23392737 DOI: 10.1007/s10985-013-9243-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2012] [Accepted: 01/11/2013] [Indexed: 06/01/2023]
Abstract
For time-to-event data, when the hazards are non-proportional, in addition to the hazard ratio, the absolute risk reduction and the restricted mean survival difference can be used to describe the time-dependent treatment effect. The absolute risk reduction measures the direct impact of the treatment on event rate or survival, and the restricted mean survival difference provides a way to evaluate the cumulative treatment effect. However, in the literature, available methods are limited for flexibly estimating these measures and making inference on them. In this article, point estimates, pointwise confidence intervals and simultaneous confidence bands of the absolute risk reduction and the restricted mean survival difference are established under a semiparametric model that can be used in a sufficiently wide range of applications. These methods are motivated by and illustrated for data from the Women's Health Initiative estrogen plus progestin clinical trial.
Collapse
Affiliation(s)
- Song Yang
- Office of Biostatistics Research, National Heart, Lung, and Blood Institute, 6701 Rockledge Dr. MSC 7913, Bethesda, MD 20892, USA.
| |
Collapse
|
37
|
Liu H, Qin J, Shen Y. Imputation for semiparametric transformation models with biased-sampling data. LIFETIME DATA ANALYSIS 2012; 18:470-503. [PMID: 22903245 PMCID: PMC3440536 DOI: 10.1007/s10985-012-9225-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2011] [Accepted: 08/01/2012] [Indexed: 06/01/2023]
Abstract
Widely recognized in many fields including economics, engineering, epidemiology, health sciences, technology and wildlife management, length-biased sampling generates biased and right-censored data but often provide the best information available for statistical inference. Different from traditional right-censored data, length-biased data have unique aspects resulting from their sampling procedures. We exploit these unique aspects and propose a general imputation-based estimation method for analyzing length-biased data under a class of flexible semiparametric transformation models. We present new computational algorithms that can jointly estimate the regression coefficients and the baseline function semiparametrically. The imputation-based method under the transformation model provides an unbiased estimator regardless whether the censoring is independent or not on the covariates. We establish large-sample properties using the empirical processes method. Simulation studies show that under small to moderate sample sizes, the proposed procedure has smaller mean square errors than two existing estimation procedures. Finally, we demonstrate the estimation procedure by a real data example.
Collapse
Affiliation(s)
- Hao Liu
- Division of Biostatistics, Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, Texas, 77030, USA
| | - Jing Qin
- Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, National Institute of Health, Bethesda, Maryland, 20892, USA
| | - Yu Shen
- Department of Biostatistics, The University of Texas M. D. Anderson Cancer Center, Houston, Texas, 77030, USA
| |
Collapse
|
38
|
Shen PS. Semiparametric mixed-effects models for clustered doubly censored data. J Appl Stat 2012. [DOI: 10.1080/02664763.2012.684874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
39
|
Ding AA, Tian S, Yu Y, Guo H. A Class of Discrete Transformation Survival Models With Application to Default Probability Prediction. J Am Stat Assoc 2012. [DOI: 10.1080/01621459.2012.682806] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
| | | | - Yan Yu
- b University of Cincinnati , Cincinnati , OH
| | - Hui Guo
- c University of Cincinnati , Cincinnati , OH , 45221
| |
Collapse
|
40
|
SONG XINYUAN, MU XIAOYUN, SUN LIUQUAN. Regression Analysis of Longitudinal Data with Time-Dependent Covariates and Informative Observation Times. Scand Stat Theory Appl 2012. [DOI: 10.1111/j.1467-9469.2011.00776.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
41
|
Chen YQ, Hu N, Cheng SC, Musoke P, Zhao LP. Estimating Regression Parameters in an Extended Proportional Odds Model. J Am Stat Assoc 2012; 107:318-330. [PMID: 22904583 DOI: 10.1080/01621459.2012.656021] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
The proportional odds model may serve as a useful alternative to the Cox proportional hazards model to study association between covariates and their survival functions in medical studies. In this article, we study an extended proportional odds model that incorporates the so-called "external" time-varying covariates. In the extended model, regression parameters have a direct interpretation of comparing survival functions, without specifying the baseline survival odds function. Semiparametric and maximum likelihood estimation procedures are proposed to estimate the extended model. Our methods are demonstrated by Monte-Carlo simulations, and applied to a landmark randomized clinical trial of a short course Nevirapine (NVP) for mother-to-child transmission (MTCT) of human immunodeficiency virus type-1 (HIV-1). Additional application includes analysis of the well-known Veterans Administration (VA) Lung Cancer Trial.
Collapse
Affiliation(s)
- Ying Qing Chen
- Full Member, Vaccine and Infectious Disease and, Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| | | | | | | | | |
Collapse
|
42
|
Yang S, Prentice RL. Estimation of the 2-sample hazard ratio function using a semiparametric model. Biostatistics 2010; 12:354-68. [PMID: 20860993 DOI: 10.1093/biostatistics/kxq061] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The hazard ratio provides a natural target for assessing a treatment effect with survival data, with the Cox proportional hazards model providing a widely used special case. In general, the hazard ratio is a function of time and provides a visual display of the temporal pattern of the treatment effect. A variety of nonproportional hazards models have been proposed in the literature. However, available methods for flexibly estimating a possibly time-dependent hazard ratio are limited. Here, we investigate a semiparametric model that allows a wide range of time-varying hazard ratio shapes. Point estimates as well as pointwise confidence intervals and simultaneous confidence bands of the hazard ratio function are established under this model. The average hazard ratio function is also studied to assess the cumulative treatment effect. We illustrate corresponding inference procedures using coronary heart disease data from the Women's Health Initiative estrogen plus progestin clinical trial.
Collapse
Affiliation(s)
- Song Yang
- Office of Biostatistics Research, National Heart, Lung, and Blood Institute, Bethesda, MD 20892, USA.
| | | |
Collapse
|
43
|
|
44
|
Abstract
In statistical analysis, when the value of a random variable is only known to be between two bounds, we say that this random variable is interval censored. This complicated censoring pattern is a common problem in research fields such as clinical trials or actuarial studies and raises challenges for statistical analysis. In this paper, we focus on regression analysis of case 2 interval-censored data. We first briefly review existing regression methods and an estimation approach under the class of linear transformation models developed by Zhang et al. We then propose a method for survival probability prediction via generalized estimating equations. We also consider a graphical model checking technique and a model selection tool. Some theoretical properties are established and the performance of our procedures is evaluated and illustrated by numerical studies including a real-life data analysis.
Collapse
Affiliation(s)
- Zhigang Zhang
- Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, USA
| |
Collapse
|
45
|
Hattori S. Checking the linear transformation model for clustered failure time observations. LIFETIME DATA ANALYSIS 2008; 14:253-266. [PMID: 18240021 DOI: 10.1007/s10985-008-9082-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2006] [Accepted: 01/07/2008] [Indexed: 05/25/2023]
Abstract
The linear transformation model is a semiparametric model which contains the Cox proportional hazards model and the proportional odds model as special cases. Cai et al. (Biometrika 87:867-878, 2000) have proposed an inference procedure for the linear transformation model with correlated censored observations. In this article, we develop formal and graphical model checking techniques for the linear transformation models based on cumulative sums of martingale-type residuals. The proposed method is illustrated with a clinical trial data.
Collapse
Affiliation(s)
- Satoshi Hattori
- The Biostatistics Center, Kurume University, 67 Asahi-Machi, Kurume City, Fukuoka 830-0011, Japan.
| |
Collapse
|
46
|
Generalized time-dependent conditional linear models under left truncation and right censoring. ANN I STAT MATH 2008. [DOI: 10.1007/s10463-008-0187-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
47
|
Kosorok MR, Song R. Inference under right censoring for transformation models with a change-point based on a covariate threshold. Ann Stat 2007. [DOI: 10.1214/009053606000001244] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
48
|
Scheike TH. A flexible semiparametric transformation model for survival data. LIFETIME DATA ANALYSIS 2006; 12:461-80. [PMID: 17031497 DOI: 10.1007/s10985-006-9021-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2006] [Accepted: 07/28/2006] [Indexed: 05/12/2023]
Abstract
I suggest an extension of the semiparametric transformation model that specifies a time-varying regression structure for the transformation, and thus allows time-varying structure in the data. Special cases include a stratified version of the usual semiparametric transformation model. The model can be thought of as specifying a first order Taylor expansion of a completely flexible baseline. Large sample properties are derived and estimators of the asymptotic variances of the regression coefficients are given. The method is illustrated by a worked example and a small simulation study. A goodness of fit procedure for testing if the regression effects lead to a satisfactory fit is also suggested.
Collapse
Affiliation(s)
- Thomas H Scheike
- Department of Biostatistics, University of Copenhagen, Øster Farimagsgade 5 B, 2099, Copenhagen K, Denmark.
| |
Collapse
|
49
|
Lu W. Marginal regression of multivariate event times based on linear transformation models. LIFETIME DATA ANALYSIS 2005; 11:389-404. [PMID: 16133886 DOI: 10.1007/s10985-005-2969-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2004] [Accepted: 03/02/2005] [Indexed: 05/04/2023]
Abstract
Multivariate event time data are common in medical studies and have received much attention recently. In such data, each study subject may potentially experience several types of events or recurrences of the same type of event, or event times may be clustered. Marginal distributions are specified for the multivariate event times in multiple events and clustered events data, and for the gap times in recurrent events data, using the semiparametric linear transformation models while leaving the dependence structures for related events unspecified. We propose several estimating equations for simultaneous estimation of the regression parameters and the transformation function. It is shown that the resulting regression estimators are asymptotically normal, with variance-covariance matrix that has a closed form and can be consistently estimated by the usual plug-in method. Simulation studies show that the proposed approach is appropriate for practical use. An application to the well-known bladder cancer tumor recurrences data is also given to illustrate the methodology.
Collapse
Affiliation(s)
- Wenbin Lu
- Department of Statistics, North Carolina State University, Raleigh, 27695, USA.
| |
Collapse
|
50
|
Kosorok MR, Lee BL, Fine JP. Robust inference for univariate proportional hazards frailty regression models. Ann Stat 2004. [DOI: 10.1214/009053604000000535] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|