1
|
Bühler A, Cook RJ, Lawless JF. Estimands and Cumulative Incidence Function Regression in Clinical Trials: Some New Results on Interpretability and Robustness. Stat Med 2024; 43:5513-5533. [PMID: 39468894 PMCID: PMC11589047 DOI: 10.1002/sim.10236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 06/06/2024] [Accepted: 09/17/2024] [Indexed: 10/30/2024]
Abstract
Regression analyses based on transformations of cumulative incidence functions are often adopted when modeling and testing for treatment effects in clinical trial settings involving competing and semi-competing risks. Common frameworks include the Fine-Gray model and models based on direct binomial regression. Using large sample theory we derive the limiting values of treatment effect estimators based on such models when the data are generated according to multiplicative intensity-based models, and show that the estimand is sensitive to several process features. The rejection rates of hypothesis tests based on cumulative incidence function regression models are also examined for null hypotheses of different types, based on which a robustness property is established. In such settings supportive secondary analyses of treatment effects are essential to ensure a full understanding of the nature of treatment effects. An application to a palliative study of individuals with breast cancer metastatic to bone is provided for illustration.
Collapse
Affiliation(s)
- Alexandra Bühler
- Department of Statistics and Actuarial ScienceUniversity of WaterlooWaterlooOntarioCanada
| | - Richard J. Cook
- Department of Statistics and Actuarial ScienceUniversity of WaterlooWaterlooOntarioCanada
| | - Jerald F. Lawless
- Department of Statistics and Actuarial ScienceUniversity of WaterlooWaterlooOntarioCanada
| |
Collapse
|
2
|
Andersen PK, Wandall ENS, Pohar Perme M. Inference for transition probabilities in non-Markov multi-state models. LIFETIME DATA ANALYSIS 2022; 28:585-604. [PMID: 35764854 DOI: 10.1007/s10985-022-09560-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 06/07/2022] [Indexed: 06/15/2023]
Abstract
Multi-state models are frequently used when data come from subjects observed over time and where focus is on the occurrence of events that the subjects may experience. A convenient modeling assumption is that the multi-state stochastic process is Markovian, in which case a number of methods are available when doing inference for both transition intensities and transition probabilities. The Markov assumption, however, is quite strict and may not fit actual data in a satisfactory way. Therefore, inference methods for non-Markov models are needed. In this paper, we review methods for estimating transition probabilities in such models and suggest ways of doing regression analysis based on pseudo observations. In particular, we will compare methods using land-marking with methods using plug-in. The methods are illustrated using simulations and practical examples from medical research.
Collapse
Affiliation(s)
- Per Kragh Andersen
- Section of Biostatistics, University of Copenhagen, Øster Farimagsgade 5, PB 2099, 1014, Copenhagen K, Denmark.
| | - Eva Nina Sparre Wandall
- Section of Biostatistics, University of Copenhagen, Øster Farimagsgade 5, PB 2099, 1014, Copenhagen K, Denmark
| | - Maja Pohar Perme
- Institute for Biostatistics and Medical Informatics, Faculty of Medicine, University of Ljubljana, Vrazov Trg 2, 1000, Ljubljana, Slovenia
| |
Collapse
|
3
|
Cook K, Perkins NJ, Schisterman E, Haneuse S. A multistate competing risks framework for preconception prediction of pregnancy outcomes. BMC Med Res Methodol 2022; 22:156. [PMID: 35637547 PMCID: PMC9150288 DOI: 10.1186/s12874-022-01589-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 03/10/2022] [Indexed: 11/21/2022] Open
Abstract
Background Preconception pregnancy risk profiles—characterizing the likelihood that a pregnancy attempt results in a full-term birth, preterm birth, clinical pregnancy loss, or failure to conceive—can provide critical information during the early stages of a pregnancy attempt, when obstetricians are best positioned to intervene to improve the chances of successful conception and full-term live birth. Yet the task of constructing and validating risk assessment tools for this earlier intervention window is complicated by several statistical features: the final outcome of the pregnancy attempt is multinomial in nature, and it summarizes the results of two intermediate stages, conception and gestation, whose outcomes are subject to competing risks, measured on different time scales, and governed by different biological processes. In light of this complexity, existing pregnancy risk assessment tools largely focus on predicting a single adverse pregnancy outcome, and make these predictions at some later, post-conception time point. Methods We reframe the individual pregnancy attempt as a multistate model comprised of two nested multinomial prediction tasks: one corresponding to conception and the other to the subsequent outcome of that pregnancy. We discuss the estimation of this model in the presence of multiple stages of outcome missingness and then introduce an inverse-probability-weighted Hypervolume Under the Manifold statistic to validate the resulting multivariate risk scores. Finally, we use data from the Effects of Aspirin in Gestation and Reproduction (EAGeR) trial to illustrate how this multistate competing risks framework might be utilized in practice to construct and validate a preconception pregnancy risk assessment tool. Results In the EAGeR study population, the resulting risk profiles are able to meaningfully discriminate between the four pregnancy attempt outcomes of interest and represent a significant improvement over classification by random chance. Conclusions As illustrated in our analysis of the EAGeR data, our proposed prediction framework expands the pregnancy risk assessment task in two key ways—by considering a broader array of pregnancy outcomes and by providing the predictions at an earlier, preconception intervention window—providing obstetricians and their patients with more information and opportunities to successfully guide pregnancy attempts.
Collapse
|
4
|
Sørensen JK, Framke E, Pedersen J, Alexanderson K, Bonde JP, Farrants K, Flachs EM, Magnusson Hanson LL, Nyberg ST, Kivimäki M, Madsen IEH, Rugulies R. Work stress and loss of years lived without chronic disease: an 18-year follow-up of 1.5 million employees in Denmark. Eur J Epidemiol 2022; 37:389-400. [PMID: 35312925 PMCID: PMC9187572 DOI: 10.1007/s10654-022-00852-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 02/07/2022] [Indexed: 11/24/2022]
Abstract
We aimed to examine the association between exposure to work stress and chronic disease incidence and loss of chronic disease-free life years in the Danish workforce. The study population included 1,592,491 employees, aged 30-59 in 2000 and without prevalent chronic diseases. We assessed work stress as the combination of job strain and effort-reward imbalance using job exposure matrices. We used Cox regressions to estimate risk of incident hospital-diagnoses or death of chronic diseases (i.e., type 2 diabetes, coronary heart disease, stroke, cancer, asthma, chronic obstructive pulmonary disease, heart failure, and dementia) during 18 years of follow-up and calculated corresponding chronic disease-free life expectancy from age 30 to age 75. Individuals working in occupations with high prevalence of work stress had a higher risk of incident chronic disease compared to those in occupations with low prevalence of work stress (women: HR 1.04 (95% CI 1.02-1.05), men: HR 1.12 (95% CI 1.11-1.14)). The corresponding loss in chronic disease-free life expectancy was 0.25 (95% CI - 0.10 to 0.60) and 0.84 (95% CI 0.56-1.11) years in women and men, respectively. Additional adjustment for health behaviours attenuated these associations among men. We conclude that men working in high-stress occupations have a small loss of years lived without chronic disease compared to men working in low-stress occupations. This finding appeared to be partially attributable to harmful health behaviours. In women, high work stress indicated a very small and statistically non-significant loss of years lived without chronic disease.
Collapse
Affiliation(s)
- Jeppe K Sørensen
- National Research Centre for the Working Environment, Lersø Parkalle 105, 2100, Copenhagen, Denmark.
| | - Elisabeth Framke
- National Research Centre for the Working Environment, Lersø Parkalle 105, 2100, Copenhagen, Denmark.,The Danish Multiple Sclerosis Registry, Copenhagen University Hospital, Blegdamsvej 9, 2100, Copenhagen, Denmark
| | - Jacob Pedersen
- National Research Centre for the Working Environment, Lersø Parkalle 105, 2100, Copenhagen, Denmark
| | - Kristina Alexanderson
- Division of Insurance Medicine, Department of Clinical Neuroscience, Karolinska Institutet, 171 77, Stockholm, Sweden
| | - Jens P Bonde
- Department of Occupational and Environmental Medicine, Bispebjerg and Frederiksberg Hospital, Bispebjerg Bakke 23F, 2400, Copenhagen, Frederiksberg, Denmark.,Department of Public Health, University of Copenhagen, Øster Farimagsgade 5, 1014, Copenhagen, Denmark
| | - Kristin Farrants
- Division of Insurance Medicine, Department of Clinical Neuroscience, Karolinska Institutet, 171 77, Stockholm, Sweden
| | - Esben M Flachs
- Department of Occupational and Environmental Medicine, Bispebjerg and Frederiksberg Hospital, Bispebjerg Bakke 23F, 2400, Copenhagen, Frederiksberg, Denmark
| | - Linda L Magnusson Hanson
- Stress Research Institute at Department of Psychology, Stockholm University, Frescati Hagväg 16A, 114 19, Stockholm, Sweden
| | - Solja T Nyberg
- Department of Public Health, Clinicum, Faculty of Medicine, University of Helsinki, Yliopistonkatu 3, 00014, Helsinki, Finland
| | - Mika Kivimäki
- Department of Public Health, Clinicum, Faculty of Medicine, University of Helsinki, Yliopistonkatu 3, 00014, Helsinki, Finland.,Finnish Institute of Occupational Health, Topeliuksenkatu 41 B, 00250, Helsinki, Finland.,Department of Epidemiology and Public Health, University College of London, 1-19 Torrington Place, London, WC1E 6BT, UK
| | - Ida E H Madsen
- National Research Centre for the Working Environment, Lersø Parkalle 105, 2100, Copenhagen, Denmark
| | - Reiner Rugulies
- National Research Centre for the Working Environment, Lersø Parkalle 105, 2100, Copenhagen, Denmark.,Department of Public Health, University of Copenhagen, Øster Farimagsgade 5, 1014, Copenhagen, Denmark.,Department of Psychology, University of Copenhagen, Øster Farimagsgade 2A, 1353, Copenhagen, Denmark
| |
Collapse
|
5
|
Expected Labor Market Affiliation: A New Method Illustrated by Estimating the Impact of Perceived Stress on Time in Work, Sickness Absence and Unemployment of 37,605 Danish Employees. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18094980. [PMID: 34067104 PMCID: PMC8124718 DOI: 10.3390/ijerph18094980] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 05/05/2021] [Accepted: 05/05/2021] [Indexed: 11/29/2022]
Abstract
As detailed data on labor market affiliation become more accessible, new approaches are needed to address the complex patterns of labor market affiliation. We introduce the expected labor market affiliation (ELMA) method by estimating the time-restricted impact of perceived stress on labor market affiliation in a large sample of Danish employees. Data from two national surveys were linked with a national register. A multi-state proportional hazards model was used to calculate ELMA estimates, i.e., the number of days in work, sickness absence, and unemployment during a 4-year follow-up period, stratified by gender and age. Among employees reporting frequent work-related stress, the expected number of working days decreased with age, ranging from 103 days lost among older women to 37 days lost among younger and middle-aged men. Young and middle-aged women reporting frequent work- and personal life-related stress lost 62 and 81 working days, respectively, and had more days of sickness absence (34 days and 42 days). In conclusion, we showed that perceived stress affects the labor market affiliation. The ELMA estimates provide a detailed understanding of the impact of perceived stress on labor market affiliation over time, and may inform policy and practice towards a more healthy and sustainable working life.
Collapse
|
6
|
Zhu Y, Chen Z, Lawless JF. Semiparametric analysis of interval‐censored failure time data with outcome‐dependent observation schemes. Scand Stat Theory Appl 2021. [DOI: 10.1111/sjos.12511] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- Yayuan Zhu
- Department of Epidemiology and Biostatistics University of Western Ontario London Ontario Canada
| | - Ziqi Chen
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science‐MOE, School of Statistics East China Normal University Shanghai P.R. China
| | - Jerald F. Lawless
- Department of Statistics and Actuarial Science University of Waterloo Waterloo Ontario Canada
| |
Collapse
|
7
|
Bakoyannis G. Nonparametric analysis of nonhomogeneous multistate processes with clustered observations. Biometrics 2020; 77:533-546. [PMID: 32640037 PMCID: PMC7790918 DOI: 10.1111/biom.13327] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 06/14/2020] [Accepted: 06/24/2020] [Indexed: 12/21/2022]
Abstract
Frequently, clinical trials and observational studies involve complex event history data with multiple events. When the observations are independent, the analysis of such studies can be based on standard methods for multistate models. However, the independence assumption is often violated, such as in multicenter studies, which makes standard methods improper. This work addresses the issue of nonparametric estimation and two‐sample testing for the population‐averaged transition and state occupation probabilities under general multistate models with cluster‐correlated, right‐censored, and/or left‐truncated observations. The proposed methods do not impose assumptions regarding the within‐cluster dependence, allow for informative cluster size, and are applicable to both Markov and non‐Markov processes. Using empirical process theory, the estimators are shown to be uniformly consistent and to converge weakly to tight Gaussian processes. Closed‐form variance estimators are derived, rigorous methodology for the calculation of simultaneous confidence bands is proposed, and the asymptotic properties of the nonparametric tests are established. Furthermore, I provide theoretical arguments for the validity of the nonparametric cluster bootstrap, which can be readily implemented in practice regardless of how complex the underlying multistate model is. Simulation studies show that the performance of the proposed methods is good, and that methods that ignore the within‐cluster dependence can lead to invalid inferences. Finally, the methods are illustrated using data from a multicenter randomized controlled trial.
Collapse
|
8
|
Zhan T, Schaubel DE. Semiparametric regression methods for temporal processes subject to multiple sources of censoring. CAN J STAT 2019. [DOI: 10.1002/cjs.11528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Tianyu Zhan
- Department of BiostatisticsUniversity of Michigan 1415 Washington Heights Ann Arbor MI 48109 U.S.A
| | - Douglas E. Schaubel
- Department of Biostatistics, Epidemiology, and InformaticsUniversity of Pennsylvania 423 Guardian Drive Philadelphia PA 19104 U.S.A
| |
Collapse
|
9
|
Lawless JF, Cook RJ. A new perspective on loss to follow‐up in failure time and life history studies. Stat Med 2019; 38:4583-4610. [DOI: 10.1002/sim.8318] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Revised: 05/07/2019] [Accepted: 06/20/2019] [Indexed: 11/11/2022]
Affiliation(s)
- Jerald F. Lawless
- Department of Statistics and Actuarial ScienceUniversity of Waterloo Waterloo Ontario Canada
| | - Richard J. Cook
- Department of Statistics and Actuarial ScienceUniversity of Waterloo Waterloo Ontario Canada
| |
Collapse
|
10
|
Zhan T, Schaubel DE. Semiparametric temporal process regression of survival-out-of-hospital. LIFETIME DATA ANALYSIS 2019; 25:322-340. [PMID: 29796979 PMCID: PMC6251773 DOI: 10.1007/s10985-018-9433-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 05/09/2018] [Indexed: 06/08/2023]
Abstract
The recurrent/terminal event data structure has undergone considerable methodological development in the last 10-15 years. An example of the data structure that has arisen with increasing frequency involves the recurrent event being hospitalization and the terminal event being death. We consider the response Survival-Out-of-Hospital, defined as a temporal process (indicator function) taking the value 1 when the subject is currently alive and not hospitalized, and 0 otherwise. Survival-Out-of-Hospital is a useful alternative strategy for the analysis of hospitalization/survival in the chronic disease setting, with the response variate representing a refinement to survival time through the incorporation of an objective quality-of-life component. The semiparametric model we consider assumes multiplicative covariate effects and leaves unspecified the baseline probability of being alive-and-out-of-hospital. Using zero-mean estimating equations, the proposed regression parameter estimator can be computed without estimating the unspecified baseline probability process, although baseline probabilities can subsequently be estimated for any time point within the support of the censoring distribution. We demonstrate that the regression parameter estimator is asymptotically normal, and that the baseline probability function estimator converges to a Gaussian process. Simulation studies are performed to show that our estimating procedures have satisfactory finite sample performances. The proposed methods are applied to the Dialysis Outcomes and Practice Patterns Study (DOPPS), an international end-stage renal disease study.
Collapse
Affiliation(s)
- Tianyu Zhan
- Department of Biostatistics, University of Michigan, 1415 Washington Hts., Ann Arbor, MI, 48109-2029, USA
| | - Douglas E Schaubel
- Department of Biostatistics, University of Michigan, 1415 Washington Hts., Ann Arbor, MI, 48109-2029, USA.
| |
Collapse
|
11
|
Siriwardhana C, Kulasekera KB, Datta S. Flexible semi-parametric regression of state occupational probabilities in a multistate model with right-censored data. LIFETIME DATA ANALYSIS 2018; 24:464-491. [PMID: 28819787 PMCID: PMC5816729 DOI: 10.1007/s10985-017-9403-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Accepted: 07/23/2017] [Indexed: 06/07/2023]
Abstract
Inference for the state occupation probabilities, given a set of baseline covariates, is an important problem in survival analysis and time to event multistate data. We introduce an inverse censoring probability re-weighted semi-parametric single index model based approach to estimate conditional state occupation probabilities of a given individual in a multistate model under right-censoring. Besides obtaining a temporal regression function, we also test the potential time varying effect of a baseline covariate on future state occupation. We show that the proposed technique has desirable finite sample performances and its performance is competitive when compared with three other existing approaches. We illustrate the proposed methodology using two different data sets. First, we re-examine a well-known data set dealing with leukemia patients undergoing bone marrow transplant with various state transitions. Our second illustration is based on data from a study involving functional status of a set of spinal cord injured patients undergoing a rehabilitation program.
Collapse
Affiliation(s)
- Chathura Siriwardhana
- Department of Complementary and Integrative Medicine, John A. Burns School of Medicine, University of Hawaii, Honolulu, HI, USA
| | - K B Kulasekera
- Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, USA
| | - Somnath Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
12
|
Azarang L, Scheike T, de Uña-Álvarez J. Direct modeling of regression effects for transition probabilities in the progressive illness-death model. Stat Med 2017; 36:1964-1976. [PMID: 28238225 DOI: 10.1002/sim.7245] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Revised: 01/05/2017] [Accepted: 01/18/2017] [Indexed: 11/11/2022]
Abstract
In this work, we present direct regression analysis for the transition probabilities in the possibly non-Markov progressive illness-death model. The method is based on binomial regression, where the response is the indicator of the occupancy for the given state along time. Randomly weighted score equations that are able to remove the bias due to censoring are introduced. By solving these equations, one can estimate the possibly time-varying regression coefficients, which have an immediate interpretation as covariate effects on the transition probabilities. The performance of the proposed estimator is investigated through simulations. We apply the method to data from the Registry of Systematic Lupus Erythematosus RELESSER, a multicenter registry created by the Spanish Society of Rheumatology. Specifically, we investigate the effect of age at Lupus diagnosis, sex, and ethnicity on the probability of damage and death along time. Copyright © 2017 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Leyla Azarang
- Aix-Marseille Univ, INSERM, IRD, SESSTIM, Sciences Economiques et Sociales de la Santé et Traitement de l'Information Médicale, Marseille, France
| | - Thomas Scheike
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Jacobo de Uña-Álvarez
- Statistical Inference, Decision and Operations Research (SiDOR) Group, University of Vigo, Spain.,Department of Statistics and Operations Research Centre for Biomedical Research (CINBIO), University of Vigo, Spain
| |
Collapse
|
13
|
Dutta S, Datta S, Datta S. Temporal Prediction of Future State Occupation in a Multistate Model from High-Dimensional Baseline Covariates via Pseudo-Value Regression. J STAT COMPUT SIM 2016; 87:1363-1378. [PMID: 29217870 DOI: 10.1080/00949655.2016.1263992] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
In many complex diseases such as cancer, a patient undergoes various disease stages before reaching a terminal state (say disease free or death). This fits a multistate model framework where a prognosis may be equivalent to predicting the state occupation at a future time t. With the advent of high throughput genomic and proteomic assays, a clinician may intent to use such high dimensional covariates in making better prediction of state occupation. In this article, we offer a practical solution to this problem by combining a useful technique, called pseudo value regression, with a latent factor or a penalized regression method such as the partial least squares (PLS) or the least absolute shrinkage and selection operator (LASSO), or their variants. We explore the predictive performances of these combinations in various high dimensional settings via extensive simulation studies. Overall, this strategy works fairly well provided the models are tuned properly. Overall, the PLS turns out to be slightly better than LASSO in most settings investigated by us, for the purpose of temporal prediction of future state occupation. We illustrate the utility of these pseudo-value based high dimensional regression methods using a lung cancer data set where we use the patients' baseline gene expression values.
Collapse
Affiliation(s)
- Sandipan Dutta
- Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, USA
| | - Susmita Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, USA
| | - Somnath Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, USA
| |
Collapse
|
14
|
Han S, Andrei AC, Tsui KW. A flexible semiparametric modeling approach for doubly censored data with an application to prostate cancer. Stat Methods Med Res 2016; 25:1718-35. [PMID: 23907782 PMCID: PMC8380435 DOI: 10.1177/0962280213498325] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Doubly censored data often arise in medical studies of disease progression involving two related events for which both an originating and a terminating event are interval-censored. Although regression modeling for such doubly censored data may be complicated, we propose a simple semiparametric regression modeling strategy based on jackknife pseudo-observations obtained using nonparametric estimators of the survival function. Inference is carried out via generalized estimating equations. Simulations studies show that the proposed method produces virtually unbiased covariate effect estimates, even for moderate sample sizes. A prostate cancer study example illustrates the practical advantages of the proposed approach.
Collapse
Affiliation(s)
- Seungbong Han
- Department of Clinical Epidemiology and Biostatistics, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
| | - Adin-Cristian Andrei
- BCVI Clinical Trials Unit, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Kam-Wah Tsui
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
15
|
Ambrogi F, Scheike TH. Penalized estimation for competing risks regression with applications to high-dimensional covariates. Biostatistics 2016; 17:708-21. [DOI: 10.1093/biostatistics/kxw017] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Accepted: 03/12/2016] [Indexed: 11/12/2022] Open
Abstract
High-dimensional regression has become an increasingly important topic for many research fields. For example, biomedical research generates an increasing amount of data to characterize patients' bio-profiles (e.g. from a genomic high-throughput assay). The increasing complexity in the characterization of patients' bio-profiles is added to the complexity related to the prolonged follow-up of patients with the registration of the occurrence of possible adverse events. This information may offer useful insight into disease dynamics and in identifying subset of patients with worse prognosis and better response to the therapy. Although in the last years the number of contributions for coping with high and ultra-high-dimensional data in standard survival analysis have increased (Witten and Tibshirani, 2010. Survival analysis with high-dimensional covariates. Statistical Methods in Medical Research19(1), 29–51), the research regarding competing risks is less developed (Binder and others, 2009. Boosting for high-dimensional time-to-event data with competing risks. Bioinformatics25(7), 890–896). The aim of this work is to consider how to do penalized regression in the presence of competing events. The direct binomial regression model of Scheike and others (2008. Predicting cumulative incidence probability by direct binomial regression. Biometrika95(1), 205–220) is reformulated in a penalized framework to possibly fit a sparse regression model. The developed approach is easily implementable using existing high-performance software to do penalized regression. Results from simulation studies are presented together with an application to genomic data when the endpoint is progression-free survival. An R function is provided to perform regularized competing risks regression according to the binomial model in the package timereg (Scheike and Martinussen, 2006. Dynamic Regression models for survival data. New York: Springer), available through CRAN.
Collapse
|
16
|
Eriksson F, Li J, Scheike T, Zhang MJ. The proportional odds cumulative incidence model for competing risks. Biometrics 2015; 71:687-95. [PMID: 26013050 DOI: 10.1111/biom.12330] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2014] [Revised: 03/01/2015] [Accepted: 03/01/2015] [Indexed: 11/30/2022]
Abstract
We suggest an estimator for the proportional odds cumulative incidence model for competing risks data. The key advantage of this model is that the regression parameters have the simple and useful odds ratio interpretation. The model has been considered by many authors, but it is rarely used in practice due to the lack of reliable estimation procedures. We suggest such procedures and show that their performance improve considerably on existing methods. We also suggest a goodness-of-fit test for the proportional odds assumption. We derive the large sample properties and provide estimators of the asymptotic variance. The method is illustrated by an application in a bone marrow transplant study and the finite-sample properties are assessed by simulations.
Collapse
Affiliation(s)
- Frank Eriksson
- Section of Biostatistics, University of Copenhagen, Øster Farimagsgade 5, Copenhagen DK-1014, Denmark
| | - Jianing Li
- Division of Biostatistics, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, Wisconsin 53226, U.S.A
| | - Thomas Scheike
- Section of Biostatistics, University of Copenhagen, Øster Farimagsgade 5, Copenhagen DK-1014, Denmark
| | - Mei-Jie Zhang
- Division of Biostatistics, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, Wisconsin 53226, U.S.A
| |
Collapse
|
17
|
van Houwelingen HC, Putter H. Comparison of stopped Cox regression with direct methods such as pseudo-values and binomial regression. LIFETIME DATA ANALYSIS 2015; 21:180-196. [PMID: 25084763 DOI: 10.1007/s10985-014-9299-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Accepted: 07/09/2014] [Indexed: 06/03/2023]
Abstract
By far the most popular model to obtain survival predictions for individual patients is the Cox model. The Cox model does not make any assumptions on the underlying hazard, but it relies heavily on the proportional hazards assumption. The most common ways to circumvent this robustness problem are 1) to categorize patients based on their prognostic risk score and to base predictions on Kaplan-Meier curves for the risk categories, or 2) to include interactions with the covariates and suitable functions of time. Robust estimators of the t(0)-year survival probabilities can also be obtained from a "stopped Cox" regression model, in which all observations are administratively censored at t(0). Other recent approaches to solve this robustness problem, originally proposed in the context of competing risks, are pseudo-values and direct binomial regression, based on unbiased estimating equations. In this paper stopped Cox regression is compared with these direct approaches. This is done by means of a simulation study to assess the biases of the different approaches and an analysis of breast cancer data to get some feeling for the performance in practice. The tentative conclusion is that stopped Cox and direct models agree well if the follow-up is not too long. There are larger differences for long-term follow-up data. There stopped Cox might be more efficient, but less robust.
Collapse
Affiliation(s)
- Hans C van Houwelingen
- Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Postzone S-5-P, PO Box 9600, 2300 RC , Leiden, The Netherlands
| | | |
Collapse
|
18
|
Varying coefficient subdistribution regression for left-truncated semi-competing risks data. J MULTIVARIATE ANAL 2014; 131:65-78. [PMID: 25125711 DOI: 10.1016/j.jmva.2014.06.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Semi-competing risks data frequently arise in biomedical studies when time to a disease landmark event is subject to dependent censoring by death, the observation of which however is not precluded by the occurrence of the landmark event. In observational studies, the analysis of such data can be further complicated by left truncation. In this work, we study a varying co-efficient subdistribution regression model for left-truncated semi-competing risks data. Our method appropriately accounts for the specifical truncation and censoring features of the data, and moreover has the flexibility to accommodate potentially varying covariate effects. The proposed method can be easily implemented and the resulting estimators are shown to have nice asymptotic properties. We also present inference, such as Kolmogorov-Smirnov type and Cramér Von-Mises type hypothesis testing procedures for the covariate effects. Simulation studies and an application to the Denmark diabetes registry demonstrate good finite-sample performance and practical utility of the proposed method.
Collapse
|
19
|
van Houwelingen HC. From model building to validation and back: a plea for robustness. Stat Med 2014; 33:5223-38. [DOI: 10.1002/sim.6282] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Revised: 07/15/2014] [Accepted: 07/15/2014] [Indexed: 11/11/2022]
Affiliation(s)
- Hans C. van Houwelingen
- Medical Statistics and Bioinformatics; Leiden University Medical Centre; Leiden The Netherlands
| |
Collapse
|
20
|
Han S, Andrei AC, Tsui KW. A Semiparametric Regression Method for Interval-Censored Data. COMMUN STAT-SIMUL C 2013. [DOI: 10.1080/03610918.2012.697962] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
21
|
Logan BR. Review of multistate models in hematopoietic cell transplantation studies. Biol Blood Marrow Transplant 2012; 19:S84-7. [PMID: 23084957 DOI: 10.1016/j.bbmt.2012.10.016] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Brent R Logan
- Division of Biostatistics and Center for International Blood and Marrow Transplant Research, Medical College of Wisconsin, Milwaukee, Wisconsin 532260509, USA.
| |
Collapse
|
22
|
Bartolomeo N, Trerotoli P, Serio G. Progression of liver cirrhosis to HCC: an application of hidden Markov model. BMC Med Res Methodol 2011; 11:38. [PMID: 21457586 PMCID: PMC3087702 DOI: 10.1186/1471-2288-11-38] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2010] [Accepted: 04/04/2011] [Indexed: 01/05/2023] Open
Abstract
Background Health service databases of administrative type can be a useful tool for the study of progression of a disease, but the data reported in such sources could be affected by misclassifications of some patients' real disease states at the time. Aim of this work was to estimate the transition probabilities through the different degenerative phases of liver cirrhosis using health service databases. Methods We employed a hidden Markov model to determine the transition probabilities between two states, and of misclassification. The covariates inserted in the model were sex, age, the presence of comorbidities correlated with alcohol abuse, the presence of diagnosis codes indicating hepatitis C virus infection, and the Charlson Index. The analysis was conducted in patients presumed to have suffered the onset of cirrhosis in 2000, observing the disease evolution and, if applicable, death up to the end of the year 2006. Results The incidence of hepatocellular carcinoma (HCC) in cirrhotic patients was 1.5% per year. The probability of developing HCC is higher in males (OR = 2.217) and patients over 65 (OR = 1.547); over 65-year-olds have a greater probability of death both while still suffering from cirrhosis (OR = 2.379) and if they have developed HCC (OR = 1.410). A more severe casemix affects the transition from HCC to death (OR = 1.714). The probability of misclassifying subjects with HCC as exclusively affected by liver cirrhosis is 14.08%. Conclusions The hidden Markov model allowing for misclassification is well suited to analyses of health service databases, since it is able to capture bias due to the fact that the quality and accuracy of the available information are not always optimal. The probability of evolution of a cirrhotic subject to HCC depends on sex and age class, while hepatitis C virus infection and comorbidities correlated with alcohol abuse do not seem to have an influence.
Collapse
Affiliation(s)
- Nicola Bartolomeo
- Department of Biomedical Science and Human Oncology, Chair of Medical Statistics, University of Bari, Bari, Italy.
| | | | | |
Collapse
|
23
|
Allignol A, Schumacher M, Beyersmann J. Estimating summary functionals in multistate models with an application to hospital infection data. Comput Stat 2010. [DOI: 10.1007/s00180-010-0200-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
24
|
|
25
|
Graw F, Gerds TA, Schumacher M. On pseudo-values for regression analysis in competing risks models. LIFETIME DATA ANALYSIS 2009; 15:241-255. [PMID: 19051013 DOI: 10.1007/s10985-008-9107-z] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2008] [Accepted: 11/05/2008] [Indexed: 05/27/2023]
Abstract
For regression on state and transition probabilities in multi-state models Andersen et al. (Biometrika 90:15-27, 2003) propose a technique based on jackknife pseudo-values. In this article we analyze the pseudo-values suggested for competing risks models and prove some conjectures regarding their asymptotics (Klein and Andersen, Biometrics 61:223-229, 2005). The key is a second order von Mises expansion of the Aalen-Johansen estimator which yields an appropriate representation of the pseudo-values. The method is illustrated with data from a clinical study on total joint replacement. In the application we consider for comparison the estimates obtained with the Fine and Gray approach (J Am Stat Assoc 94:496-509, 1999) and also time-dependent solutions of pseudo-value regression equations.
Collapse
Affiliation(s)
- Frederik Graw
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland.
| | | | | |
Collapse
|
26
|
van Houwelingen HC, Putter H. Dynamic predicting by landmarking as an alternative for multi-state modeling: an application to acute lymphoid leukemia data. LIFETIME DATA ANALYSIS 2008; 14:447-463. [PMID: 18836831 PMCID: PMC2798037 DOI: 10.1007/s10985-008-9099-8] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2008] [Accepted: 09/10/2008] [Indexed: 05/26/2023]
Abstract
This paper considers the problem of obtaining a dynamic prediction for 5-year failure free survival after bone marrow transplantation in ALL patients using data from the EBMT, the European Group for Blood and Marrow Transplantation. The paper compares the new landmark methodology as developed by the first author and the established multi-state modeling as described in a recent Tutorial in Biostatistics in Statistics in Medicine by the second author and colleagues. As expected the two approaches give similar results. The landmark methodology does not need complex modeling and leads to easy prediction rules. On the other hand, it does not give the insight in the biological processes as obtained for the multi-state model.
Collapse
Affiliation(s)
- Hans C. van Houwelingen
- Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Post Zone S5-P, P.O. Box 9600, Leiden, 2300 RC The Netherlands
| | - Hein Putter
- Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Post Zone S5-P, P.O. Box 9600, Leiden, 2300 RC The Netherlands
| |
Collapse
|
27
|
Andersen PK, Pohar Perme M. Inference for outcome probabilities in multi-state models. LIFETIME DATA ANALYSIS 2008; 14:405-31. [PMID: 18791824 PMCID: PMC2735091 DOI: 10.1007/s10985-008-9097-x] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2008] [Accepted: 08/12/2008] [Indexed: 05/26/2023]
Abstract
In bone marrow transplantation studies, patients are followed over time and a number of events may be observed. These include both ultimate events like death and relapse and transient events like graft versus host disease and graft recovery. Such studies, therefore, lend themselves for using an analytic approach based on multi-state models. We will give a review of such methods with emphasis on regression models for both transition intensities and transition- and state occupation probabilities. Both semi-parametric models, like the Cox regression model, and parametric models based on piecewise constant intensities will be discussed.
Collapse
Affiliation(s)
- Per Kragh Andersen
- Department of Biostatistics, University of Copenhagen, O. Farimagsgade 5, PB 2099, 1014, Copenhagen K, Denmark.
| | | |
Collapse
|
28
|
Fiocco M, Putter H, van Houwelingen HC. Reduced-rank proportional hazards regression and simulation-based prediction for multi-state models. Stat Med 2008; 27:4340-58. [DOI: 10.1002/sim.3305] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
29
|
Bartolomeo N, Trerotoli P, Moretti A, Serio G. A Markov model to evaluate hospital readmission. BMC Med Res Methodol 2008; 8:23. [PMID: 18430214 PMCID: PMC2386136 DOI: 10.1186/1471-2288-8-23] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2008] [Accepted: 04/22/2008] [Indexed: 11/25/2022] Open
Abstract
Background The analysis of non-fatal recurring events is frequently found in studies on chronic-degenerative diseases. The aim of this paper is to estimate the probability of readmission of patients with Chronic Obstructive Pulmonary Disease (COPD) or with Respiratory Failure (RF). Methods The Repeated hospital admissions of a patient are considered as a Markov Chain. The transitions between the states are estimated using the Nelson-Aalen estimator. The analysis was carried out using the Puglia Region hospital patient discharge database for the years 1998–2005. Patients were selected on the basis of first admission between 01/01/2001 and 31/12/2005 with ICD-9-CM code of COPD or RF as principal and/or secondary diagnosis. For those selected two possible transitions were considered in the case they had the second and third admission with an ICD-9-CM code of COPD or RF as principal diagnosis. Results The probability of readmission is increased in patients with a diagnosis of RF (OR = 1.618 in the first transition and 1.279 in the second) and also in those with a diagnosis of COPD or RF as the principal diagnosis at first admission (OR = 1.615 in the first transition and 1.193 in the second). The clinical gravity and the ward from which they were discharged did not significantly influence the probability of readmission. Conclusion The time to readmission depends on the gravity of the pathology at onset. In patients with a grave clinical picture, either COPD or Respiratory Failure, when treated and controlled after the first admission, they become minor problems and they are indicated among secondary diagnoses in any further admission.
Collapse
Affiliation(s)
- Nicola Bartolomeo
- Department of Biomedical Science and Human Oncology, Chair of Medical Statistics, University of Bari, Italy.
| | | | | | | |
Collapse
|
30
|
Beyersmann J, Schumacher M. Time-dependent covariates in the proportional subdistribution hazards model for competing risks. Biostatistics 2008; 9:765-76. [PMID: 18434297 DOI: 10.1093/biostatistics/kxn009] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Separate Cox analyses of all cause-specific hazards are the standard technique of choice to study the effect of a covariate in competing risks, but a synopsis of these results in terms of cumulative event probabilities is challenging. This difficulty has led to the development of the proportional subdistribution hazards model. If the covariate is known at baseline, the model allows for a summarizing assessment in terms of the cumulative incidence function. black Mathematically, the model also allows for including random time-dependent covariates, but practical implementation has remained unclear due to a certain risk set peculiarity. We use the intimate relationship of discrete covariates and multistate models to naturally treat time-dependent covariates within the subdistribution hazards framework. The methodology then straightforwardly translates to real-valued time-dependent covariates. As with classical survival analysis, including time-dependent covariates does not result in a model for probability functions anymore. Nevertheless, the proposed methodology provides a useful synthesis of separate cause-specific hazards analyses. We illustrate this with hospital infection data, where time-dependent covariates and competing risks are essential to the subject research question.
Collapse
Affiliation(s)
- Jan Beyersmann
- Freiburg Centre for Data Analysis and Modelling, University of Freiburg, Freiburg, Germany.
| | | |
Collapse
|