1
|
Gleiss A, Gnant M, Schemper M. Explained variation and degrees of necessity and of sufficiency for competing risks survival data. Biom J 2024; 66:e2300140. [PMID: 38409618 DOI: 10.1002/bimj.202300140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 11/15/2023] [Accepted: 12/08/2023] [Indexed: 02/28/2024]
Abstract
In this contribution, the Schemper-Henderson measure of explained variation for survival outcomes is extended to accommodate competing events (CEs) in addition to events of interest. The extension is achieved by moving from the unconditional and conditional survival functions of the original measure to unconditional and conditional cumulative incidence functions, the latter obtained, for example, from Fine and Gray models. In the absence of CEs, the original measure is obtained as a special case. We define explained variation on the population level and provide two different types of estimates. Recently, the authors have achieved a multiplicative decomposition of explained variation into degrees of necessity and degrees of sufficiency. These measures are also extended to the case of competing risks survival data. A SAS macro and an R function are provided to facilitate application. Interesting empirical properties of the measures are explored on the population level and by an extensive simulation study. Advantages of the approach are exemplified by an Austrian study of breast cancer with a high proportion of CEs.
Collapse
Affiliation(s)
- Andreas Gleiss
- Center for Medical Data Science, Institute of Clinical Biometrics, Medical University of Vienna, Vienna, Austria
| | - Michael Gnant
- Comprehensive Cancer Center, Medical University of Vienna, Vienna, Austria
| | - Michael Schemper
- Center for Medical Data Science, Institute of Clinical Biometrics, Medical University of Vienna, Vienna, Austria
| |
Collapse
|
2
|
Chhoa H, Chabriat H, Chevret S, Biard L. Comparison of models for stroke-free survival prediction in patients with CADASIL. Sci Rep 2023; 13:22443. [PMID: 38105268 PMCID: PMC10725863 DOI: 10.1038/s41598-023-49552-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 12/09/2023] [Indexed: 12/19/2023] Open
Abstract
Cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy, which is caused by mutations of the NOTCH3 gene, has a large heterogeneous progression, presenting with declines of various clinical scores and occurrences of various clinical event. To help assess disease progression, this work focused on predicting the composite endpoint of stroke-free survival time by comparing the performance of Cox proportional hazards regression to that of machine learning models using one of four feature selection approaches applied to demographic, clinical and magnetic resonance imaging observational data collected from a study cohort of 482 patients. The quality of the modeling process and the predictive performance were evaluated in a nested cross-validation procedure using the time-dependent Brier Score and AUC at 5 years from baseline, the former measuring the overall performance including calibration and the latter highlighting the discrimination ability, with both metrics taking into account the presence of right-censoring. The best model for each metric was the componentwise gradient boosting model with a mean Brier score of 0.165 and the random survival forest model with a mean AUC of 0.773, both combined with the LASSO feature selection method.
Collapse
Affiliation(s)
- Henri Chhoa
- ECSTRRA Team, Université Paris Cité, UMR1153, INSERM, Paris, France
| | - Hugues Chabriat
- Centre NeuroVasculaire Translationnel - Centre de Référence CERVCO, DMU NeuroSciences, Hôpital Lariboisière, GHU APHP-Nord, Université Paris Cité, Paris, France
- INSERM NeuroDiderot UMR 1141, GenMedStroke Team, Paris, France
| | - Sylvie Chevret
- ECSTRRA Team, Université Paris Cité, UMR1153, INSERM, Paris, France
| | - Lucie Biard
- ECSTRRA Team, Université Paris Cité, UMR1153, INSERM, Paris, France.
| |
Collapse
|
3
|
Zou Y, Yue M, Jia L, Wang Y, Chen H, Zhang A, Xia X, Liu W, Yu R, Yang S, Huang P. Accurate prediction of HCC risk after SVR in patients with hepatitis C cirrhosis based on longitudinal data. BMC Cancer 2023; 23:1147. [PMID: 38007418 PMCID: PMC10676612 DOI: 10.1186/s12885-023-11628-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 11/09/2023] [Indexed: 11/27/2023] Open
Abstract
BACKGROUND Most existing predictive models of hepatocellular carcinoma (HCC) risk after sustained virologic response (SVR) are built on data collected at baseline and therefore have limited accuracy. The current study aimed to construct an accurate predictive model incorporating longitudinal data using a novel modeling strategy. The predictive performance of the longitudinal model was also compared with a baseline model. METHODS A total of 400 patients with HCV-related cirrhosis who achieved SVR with direct-acting antivirals (DAA) were enrolled in the study. Patients were randomly divided into a training set (70%) and a validation set (30%). Informative features were extracted from the longitudinal variables and then put into the random survival forest (RSF) to develop the longitudinal model. A baseline model including the same variables was built for comparison. RESULTS During a median follow-up time of approximately 5 years, 25 patients (8.9%) in the training set and 11 patients (9.2%) in the validation set developed HCC. The areas under the receiver-operating characteristics curves (AUROC) for the longitudinal model were 0.9507 (0.8838-0.9997), 0.8767 (0.6972,0.9918), and 0.8307 (0.6941,0.9993) for 1-, 2- and 3-year risk prediction, respectively. The brier scores of the longitudinal model were also relatively low for the 1-, 2- and 3-year risk prediction (0.0283, 0.0561, and 0.0501, respectively). In contrast, the baseline model only achieved mediocre AUROCs of around 0.6 (0.6113, 0.6213, and 0.6480, respectively). CONCLUSIONS Our longitudinal model yielded accurate predictions of HCC risk in patients with HCV-relate cirrhosis, outperforming the baseline model. Our model can provide patients with valuable prognosis information and guide the intensity of surveillance in clinical practice.
Collapse
Affiliation(s)
- Yanzheng Zou
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China
| | - Ming Yue
- Department of Infectious Diseases, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Linna Jia
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China
| | - Yifan Wang
- Department of Infectious Disease, Jurong Hospital Affiliated to Jiangsu University, Jurong, China
| | - Hongbo Chen
- Department of Infectious Disease, Jurong Hospital Affiliated to Jiangsu University, Jurong, China
| | - Amei Zhang
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Yunnan, China
| | - Xueshan Xia
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Yunnan, China
- Kunming Medical University, Kunming, China
| | - Wei Liu
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China
- Beijing Institute of Microbiology and Epidemiology, State Key Laboratory of Pathogen and Biosecurity, Beijing, China
| | - Rongbin Yu
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China.
| | - Sheng Yang
- Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China.
| | - Peng Huang
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China.
| |
Collapse
|
4
|
Ding M, Ning J, Li R. Evaluation of competing risks prediction models using polytomous discrimination index. CAN J STAT 2021; 49:731-753. [PMID: 34707327 DOI: 10.1002/cjs.11583] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
For competing risks data, it is often important to predict a patient's outcome status at a clinically meaningful time point after incorporating the informative censoring due to competing risks. This can be done by adopting a regression model that relates the cumulative incidence probabilities to a set of covariates. To assess the performance of the resulting prediction tool, we propose an estimator of the polytomous discrimination index applicable to competing risks data, which can quantify a prognostic model's ability to discriminate among subjects from different outcome groups. The proposed estimator allows the prediction model to be subject to model misspecification and enjoys desirable asymptotic properties. We also develop an efficient computation algorithm that features a computational complexity of O(n log n). A perturbation resampling scheme is developed to achieve consistent variance estimation. Numerical results suggest that the estimator performs well under realistic sample sizes. We apply the proposed methods to a study of monoclonal gammopathy of undetermined significance.
Collapse
Affiliation(s)
- Maomao Ding
- Department of Statistics, Rice University, Houston, TX 77005, U.S.A
| | - Jing Ning
- Department of Biostatistics, the University of Texas MD Anderson Cancer Center, Houston, TX 77030, U.S.A
| | - Ruosha Li
- Department of Biostatistics and Data Science, the University of Texas Health Science Center at Houston, Houston, TX 77030, U.S.A
| |
Collapse
|
5
|
Shen F, Li L. Backward joint model and dynamic prediction of survival with multivariate longitudinal data. Stat Med 2021; 40:4395-4409. [PMID: 34018218 DOI: 10.1002/sim.9037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 04/21/2021] [Accepted: 05/01/2021] [Indexed: 11/05/2022]
Abstract
An important approach to dynamic prediction of time-to-event outcomes using longitudinal data is based on modeling the joint distribution of longitudinal and time-to-event data. The widely used joint model for this purpose is the shared random effect model. Presumably, adding more longitudinal predictors improves the predictive accuracy. However, the shared random effect model can be computationally difficult or prohibitive when a large number of longitudinal variables are used. In this paper, we study an alternative way of modeling the joint distribution of longitudinal and time-to-event data. Under this formulation, the log-likelihood involves no more than one-dimensional integration, regardless of the number of longitudinal variables in the model. Therefore, this model is particularly suitable in dynamic prediction problems with large number of longitudinal predictors. The model fitting can be implemented with tractable and stable computation by using a combination of pseudo maximum likelihood estimation, Expectation-Maximization algorithm, and convex optimization. We evaluate the proposed methodology and its predictive accuracy with varying number of longitudinal variables using simulations and data from a primary biliary cirrhosis study.
Collapse
Affiliation(s)
- Fan Shen
- Department of Biostatistics and Data Science, The University of Texas School of Public Health, Dallas, Texas, USA.,Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Liang Li
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| |
Collapse
|
6
|
Lin J, Li K, Luo S. Functional survival forests for multivariate longitudinal outcomes: Dynamic prediction of Alzheimer's disease progression. Stat Methods Med Res 2021; 30:99-111. [PMID: 32726189 PMCID: PMC7855476 DOI: 10.1177/0962280220941532] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The random survival forest (RSF) is a non-parametric alternative to the Cox proportional hazards model in modeling time-to-event data. In this article, we developed a modeling framework to incorporate multivariate longitudinal data in the model building process to enhance the predictive performance of RSF. To extract the essential features of the multivariate longitudinal outcomes, two methods were adopted and compared: multivariate functional principal component analysis and multivariate fast covariance estimation for sparse functional data. These resulting features, which capture the trajectories of the multiple longitudinal outcomes, are then included as time-independent predictors in the subsequent RSF model. This non-parametric modeling framework, denoted as functional survival forests, is better at capturing the various trends in both the longitudinal outcomes and the survival model which may be difficult to model using only parametric approaches. These advantages are demonstrated through simulations and applications to the Alzheimer's Disease Neuroimaging Initiative.
Collapse
Affiliation(s)
- Jeffrey Lin
- Department of Biostatistics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Kan Li
- Merck Research Laboratory, Merck & Co., North Wales, PA, USA
| | - Sheng Luo
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA
| |
Collapse
|
7
|
Wu C, Li L, Li R. Dynamic prediction of competing risk events using landmark sub-distribution hazard model with multiple longitudinal biomarkers. Stat Methods Med Res 2020; 29:3179-3191. [PMID: 32419611 PMCID: PMC10469606 DOI: 10.1177/0962280220921553] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The cause-specific cumulative incidence function quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted cumulative incidence function by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propose a framework for dynamic prediction of end stage renal disease using multivariate longitudinal biomarkers, accounting for the competing risk of death. The proposed framework extends the local estimation-based landmark survival modeling to competing risks data, and implies that a distinct sub-distribution hazard regression model is defined at each biomarker measurement time. The model parameters, prediction horizon, longitudinal history and at-risk population are allowed to vary over the landmark time. When the measurement times of biomarkers are irregularly spaced, the predictor variable may not be observed at the time of prediction. Local polynomial is used to estimate the model parameters without explicitly imputing the predictor or modeling its longitudinal trajectory. The proposed model leads to simple interpretation of the regression coefficients and closed-form calculation of the predicted cumulative incidence function. The estimation and prediction can be implemented through standard statistical software with tractable computation. We conducted simulations to evaluate the performance of the estimation procedure and predictive accuracy. The methodology is illustrated with data from the African American Study of Kidney Disease and Hypertension.
Collapse
Affiliation(s)
- Cai Wu
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, USA
- Department of Biostatistics, University of Texas School of Public Health, Houston, USA
| | - Liang Li
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, USA
| | - Ruosha Li
- Department of Biostatistics, University of Texas School of Public Health, Houston, USA
| |
Collapse
|
8
|
Abudayyeh A, Lin H, Abdelrahim M, Rondon G, Andersson BS, Martinez CS, Page VD, Tarrand JJ, Kontoyiannis DP, Marin D, Oran B, Olson A, Jones R, Popat U, Champlin RE, Chemaly RF, Shpall EJ, Rezvani K. Development and validation of a risk assessment tool for BKPyV Replication in allogeneic stem cell transplant recipients. Transpl Infect Dis 2020; 22:e13395. [PMID: 32602954 DOI: 10.1111/tid.13395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 06/15/2020] [Accepted: 06/17/2020] [Indexed: 11/29/2022]
Abstract
BACKGROUND BK polymavirus (BKPyV), a member of the family Polyomaviridae, is associated with increased morbidity and mortality in allogeneic stem cell transplant recipients. METHODS In our previous retrospective study of 2477 stem cell transplant patients, BKPyV replication independently predicted chronic kidney disease and poor survival. In this study, using the same cohort, we derived and validated a risk grading system to identify patients at risk of BKPyV replication after transplantation in a user-friendly modality. We used 3 baseline variables (conditioning regimen, HLA match status, and underlying cancer diagnosis) that significantly predicted BKPyV replication in our initial study in a subdistribution hazard model with death as a competing risk. We also developed a nomogram of the hazard model as a visual aid. The AUC of the ROC of the risk-score-only model was 0.65. We further stratified the patients on the basis of risk score into low-, moderate-, and high-risk groups. RESULTS The total risk score was significantly associated with BKPyV replication (P < .0001). At 30 days after transplantation, the low-risk (score ≤ 0) patients had a 9% chance of developing symptomatic BKPyV replication, while the high-risk (score ≥ 8) of the population had 56% of developing BKPyV replication. We validated the risk score using a separate cohort of 1478 patients. The AUC of the ROC of the risk-score-only model was 0.59. Both the total risk score and 3-level risk variable were significantly associated with BKPyV replication in this cohort (P < .0001). CONCLUSIONS This grading system for the risk of symptomatic BKPyV replication may help in early monitoring and intervention to prevent BKPyV-associated morbidity, mortality, and kidney function decline.
Collapse
Affiliation(s)
- Ala Abudayyeh
- Division of Internal Medicine, Section of Nephrology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Heather Lin
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Maen Abdelrahim
- Institute of Academic Medicine and Weill Cornell Medical College, Houston Methodist Cancer Center, Houston, TX, USA
| | - Gabriela Rondon
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Borje S Andersson
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Charles S Martinez
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Valda D Page
- Department of Emergency Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jeffrey J Tarrand
- Department of Laboratory Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Dimitrios P Kontoyiannis
- Department of Infectious Diseases, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - David Marin
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Betul Oran
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Amanda Olson
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Roy Jones
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Uday Popat
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Richard E Champlin
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Roy F Chemaly
- Department of Infectious Diseases, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Elizabeth J Shpall
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Katayoun Rezvani
- Department of Stem Cell Transplantation and Cellular Therapy, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
9
|
Wang Z, Wang X. Evaluating the time-dependent predictive accuracy for event-to-time outcome with a cure fraction. Pharm Stat 2020; 19:955-974. [PMID: 32776646 DOI: 10.1002/pst.2048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Revised: 05/10/2020] [Accepted: 06/14/2020] [Indexed: 11/08/2022]
Abstract
In medical studies, it is often observed that a portion of subjects will never experience the event of interest and thus can be treated as cured or long-term survivors. Many populations of early-stage cancer patients contain both uncured and cured individuals that should be modeled using cure models. In prognostic studies, the cure status (uncure or cure) is an issue of interest for medical practitioners, and the disease status (death or alive) of an individual is not a fixed characteristic and it varies along the time. These statuses are usually predicted by a prognostic risk score. The time-dependent receiver operating characteristic (ROC) curve is a powerful tool to evaluate these predicting performances dynamically. In the context with a cure fraction, quantifying and estimating the predictive performances of the risk score is a challenge since the disease status and cure status are both unknown among individuals who are censored. In this paper, to assess the predictive accuracy for the survival outcome with a cure fraction, we propose a time-dependent ROC curve semi-parametric estimator based on the sieve maximum likelihood (ML) estimation under the mixture cure model. We also apply a Bernstein-based smoothing method in the estimation procedure, and this estimator can lead to substantial gain in efficiency. In addition, we derive the time-dependent area under the ROC curve (AUC) to summarize the discriminatory capacity of the risk score globally. Finally, we evaluate the finite sample performance of the proposed methods by extensive simulations and illustrate the estimation using two real data sets, one from a melanoma study and the other from stomach cancer.
Collapse
Affiliation(s)
- Ziwen Wang
- School of Mathematical Sciences, Dalian University of Technology, Dalian, China
| | - Xiaoguang Wang
- School of Mathematical Sciences, Dalian University of Technology, Dalian, China
| |
Collapse
|
10
|
Li L, Yang W, Astor BC, Greene T. Competing Risk Modeling: Time to Put it in Our Standard Analytical Toolbox. J Am Soc Nephrol 2019; 30:2284-2286. [PMID: 31732615 DOI: 10.1681/asn.2019101011] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Affiliation(s)
- Liang Li
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, Texas;
| | - Wei Yang
- Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Brad C Astor
- Department of Medicine, University of Wisconsin-Madison, Madison, Wisconsin; and
| | - Tom Greene
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah
| |
Collapse
|