1
|
Davies L, Hankey BF, Wang Z, Zou Z, Scott S, Lee M, Cho H, Feuer EJ. A New Personalized Oral Cancer Survival Calculator to Estimate Risk of Death From Both Oral Cancer and Other Causes. JAMA Otolaryngol Head Neck Surg 2023; 149:993-1000. [PMID: 37429022 PMCID: PMC10334297 DOI: 10.1001/jamaoto.2023.1975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 06/13/2023] [Indexed: 07/12/2023]
Abstract
Importance Standard cancer prognosis models typically do not include much specificity in characterizing competing illnesses or general health status when providing prognosis estimates, limiting their utility for individuals, who must consider their cancer in the context of their overall health. This is especially true for patients with oral cancer, who frequently have competing illnesses. Objective To describe a statistical framework and accompanying new publicly available calculator that provides personalized estimates of the probability of a patient surviving or dying from cancer or other causes, using oral cancer as the first data set. Design, Setting, and Participants The models used data from the Surveillance, Epidemiology, and End Results (SEER) 18 registry (2000 to 2011), SEER-Medicare linked files, and the National Health Interview Survey (NHIS) (1986 to 2009). Statistical methods developed to calculate natural life expectancy in the absence of the cancer, cancer-specific survival, and other-cause survival were applied to oral cancer data and internally validated with 10-fold cross-validation. Eligible participants were aged between 20 and 94 years with oral squamous cell carcinoma. Exposures Histologically confirmed oral cancer, general health status, smoking, and selected serious comorbid conditions. Main Outcomes and Measures Probabilities of surviving or dying from the cancer or from other causes, and life expectancy in the absence of the cancer. Results A total of 22 392 patients with oral squamous cell carcinoma (13 544 male [60.5%]; 1476 Asian and Pacific Islander [6.7%]; 1792 Black [8.0%], 1589 Hispanic [7.2%], 17 300 White [78.1%]) and 402 626 NHIS interviewees were included in this calculator designed for public use for patients ages 20 to 86 years with newly diagnosed oral cancer to obtain estimates of health status-adjusted age, life expectancy in the absence of the cancer, and the probability of surviving, dying from the cancer, or dying from other causes within 1 to 10 years after diagnosis. The models in the calculator estimated that patients with oral cancer have a higher risk of death from other causes than their matched US population, and that this risk increases by stage. Conclusions and relevance The models developed for the calculator demonstrate that survival estimates that exclude the effects of coexisting conditions can lead to underestimates or overestimates of survival. This new calculator approach will be broadly applicable for developing future prognostic models of cancer and noncancer aspects of a person's health in other cancers; as registries develop more linkages, available covariates will become broader, strengthening future tools.
Collapse
Affiliation(s)
- Louise Davies
- VA Outcomes Group, Department of Veterans Affairs Medical Center, White River Junction, Vermont
- Section of Otolaryngology in Geisel School of Medicine at Dartmouth, and The Dartmouth Institute for Health Policy and Clinical Practice, Lebanon, New Hampshire
| | - Benjamin F. Hankey
- Statistical Research and Application Branch, Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| | - Zhuoqiao Wang
- Information Management Services, Calverton, Maryland
| | - Zhaohui Zou
- Information Management Services, Calverton, Maryland
| | - Susan Scott
- Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| | - Minjung Lee
- Department of Statistics, Kangwon National University, Chuncheon, Gangwon, Korea
| | - Hyunsoon Cho
- Department of Cancer AI and Digital Health, National Cancer Center Graduate School of Cancer Science and Policy, and the Integrated Biostatistics Branch, Division of Cancer Data Science, National Cancer Center, Goyang, Gyeonggi-do, Korea
| | - Eric J. Feuer
- Statistical Research and Application Branch, Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| |
Collapse
|
2
|
Davies L, Hankey BF, Wang Z, Zou Z, Scott S, Lee M, Cho H, Feuer EJ. Key Points for Clinicians About the SEER Oral Cancer Survival Calculator. JAMA Otolaryngol Head Neck Surg 2023; 149:1042-1046. [PMID: 37429019 DOI: 10.1001/jamaoto.2023.1977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2023]
Abstract
Importance In the setting of a new cancer diagnosis, the focus is usually on the cancer as the main threat to survival, but people may have other conditions that pose an equal or greater threat to their life than their cancer: a competing risk of death. This is especially true for patients who have cancer of the oral cavity, because prolonged exposure to alcohol and tobacco are risk factors for cancer in this location but also can result in medical conditions with the potential to shorten life expectancy, competing as a cause of death that may intervene in conjunction with or before the cancer. Observations A calculator designed for public use has been released that allows patients age 20 to 86 years who have a newly diagnosed oral cancer to obtain estimates of their health status-adjusted age, life expectancy in the absence of the cancer, and probability of surviving, dying of the cancer, or dying of other causes within 1 to 10 years after diagnosis. The models in the calculator showed that patients with oral cavity cancer had a higher than average risk of death from other causes than the matched US population, and this risk increases by stage. Conclusions and Relevance The Surveillance, Epidemiology and End Results Program Oral Cancer Survival Calculator supports a holistic approach to the life of the patient, and the risk of death of other causes is treated equally to consideration of the probability of death of the cancer. This tool may be usefully paired with the other available prognostic calculators for oral cancer and is an example of the possibilities now available with registry linkages to partially overlapping or independent data sets and statistical techniques that allow the use of 2 time scales in 1 analysis.
Collapse
Affiliation(s)
- Louise Davies
- VA Outcomes Group, US Department of Veterans Affairs Medical Center, White River Junction, Vermont
- Section of Otolaryngology at the Geisel School of Medicine at Dartmouth, Dartmouth Institute for Health Policy and Clinical Practice, Lebanon, New Hampshire
| | - Benjamin F Hankey
- Statistical Research and Application Branch, Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| | - Zhuoqiao Wang
- Information Management Services, Inc, Calverton, Maryland
| | - Zhaohui Zou
- Information Management Services, Inc, Calverton, Maryland
| | - Susan Scott
- Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| | - Minjung Lee
- Department of Statistics, Kangwon National University, Chuncheon, Gangwon, Korea
| | - Hyunsoon Cho
- Department of Cancer AI and Digital Health, National Cancer Center Graduate School of Cancer Science and Policy and the Integrated Biostatistics Branch, Division of Cancer Data Science, National Cancer Center, Goyang, Gyeonggi-do, Korea
| | - Eric J Feuer
- Statistical Research and Application Branch, Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| |
Collapse
|
3
|
Wells M, Rutherford MJ, Lambert PC. Fair comparisons of cause-specific and relative survival by accounting for the systematic removal of patients from risk-sets. Cancer Epidemiol 2023; 86:102408. [PMID: 37591148 DOI: 10.1016/j.canep.2023.102408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 06/22/2023] [Accepted: 06/26/2023] [Indexed: 08/19/2023]
Abstract
BACKGROUND In population-based cancer studies it is common to try to isolate the impact of cancer by estimating net survival. Net survival is defined as the probability of surviving cancer in the absence of any other-causes of death. Net survival can be estimated either in the cause-specific or relative survival framework. Cause-specific survival considers deaths from the cancer as the event of interest. Relative survival incorporates general population expected mortality rates to represent the other-cause mortality rate. Estimation approaches in both frameworks are impacted by the systematic removal of patients from the risk-set, commonly referred to as informative censoring in the cause-specific framework. In the relative survival framework, the Pohar Perme estimator combats the effect of this systematic removal of patients through weighting. When the two frameworks have been compared, informative censoring is rarely accounted for in the cause-specific framework. METHODS We investigate the use of weighted cause-specific Kaplan-Meier estimates to overcome the impact of informative censoring and compared approaches to defining weights. Individuals remaining in the risk-set are upweighted using their predicted other-cause survival obtained through various model-based approaches. We also compare weights derived from expected mortality rates. We applied the approaches to US cancer registry data and conducted a simulation study. RESULTS Using weighted cause-specific estimates provides a better estimate of marginal net survival. The unweighted Kaplan-Meier estimates have a similar bias to the Ederer II method for relative survival. Weighted Kaplan-Meier estimates are unbiased and similar to the Pohar Perme estimator. There was little variation between the several weighting approaches. CONCLUSION In comparisons of cause-specific and relative survival, it is important to compare "like-with-like", therefore, a weighted approach should be considered for both frameworks. If researchers are interested in obtaining net measures in a cause-specific framework, then weighting is needed to account for informative censoring.
Collapse
Affiliation(s)
- Molly Wells
- Biostatistics Research Group, Department of Population Health Sciences, University of Leicester, University Road, LE1 7RH, Leicester, UK.
| | - Mark J Rutherford
- Biostatistics Research Group, Department of Population Health Sciences, University of Leicester, University Road, LE1 7RH, Leicester, UK
| | - Paul C Lambert
- Biostatistics Research Group, Department of Population Health Sciences, University of Leicester, University Road, LE1 7RH, Leicester, UK; Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, PO Box 281, 24105 Stockholm, Sweden
| |
Collapse
|
4
|
Sun T, Ding Y. Neural network on interval-censored data with application to the prediction of Alzheimer's disease. Biometrics 2023; 79:2677-2690. [PMID: 35960189 PMCID: PMC10177011 DOI: 10.1111/biom.13734] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Accepted: 08/01/2022] [Indexed: 11/28/2022]
Abstract
Alzheimer's disease (AD) is a progressive and polygenic disorder that affects millions of individuals each year. Given that there have been few effective treatments yet for AD, it is highly desirable to develop an accurate model to predict the full disease progression profile based on an individual's genetic characteristics for early prevention and clinical management. This work uses data composed of all four phases of the Alzheimer's Disease Neuroimaging Initiative (ADNI) study, including 1740 individuals with 8 million genetic variants. We tackle several challenges in this data, characterized by large-scale genetic data, interval-censored outcome due to intermittent assessments, and left truncation in one study phase (ADNIGO). Specifically, we first develop a semiparametric transformation model on interval-censored and left-truncated data and estimate parameters through a sieve approach. Then we propose a computationally efficient generalized score test to identify variants associated with AD progression. Next, we implement a novel neural network on interval-censored data (NN-IC) to construct a prediction model using top variants identified from the genome-wide test. Comprehensive simulation studies show that the NN-IC outperforms several existing methods in terms of prediction accuracy. Finally, we apply the NN-IC to the full ADNI data and successfully identify subgroups with differential progression risk profiles. Data used in the preparation of this article were obtained from the ADNI database.
Collapse
Affiliation(s)
- Tao Sun
- Center for Applied Statistics, School of Statistics, Renmin University of China, Beijing, China
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Ying Ding
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
5
|
Vinh-Hung V, Gorobets O, Natchagande G, Sargos P, Yin M, Nguyen NP, Verschraegen C, Folefac E. Low-Dose Enzalutamide in Metastatic Prostate Cancer-Longevity Over Conventional Survival Analysis. Clin Genitourin Cancer 2022; 20:e473-e484. [PMID: 35778336 DOI: 10.1016/j.clgc.2022.05.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 01/08/2022] [Accepted: 05/30/2022] [Indexed: 01/10/2023]
Abstract
BACKGROUND Enzalutamide is an important drug in the treatment of prostate cancer. Standard dosing often requires dose reduction because of side effects. There is no information on survival outcomes with lower doses. We investigated the impact of starting enzalutamide at ≤ 50% dose on metastatic prostate cancer outcomes including patients' longevity. PATIENTS AND METHODS Records of metastatic prostate cancer patients treated with enzalutamide at one center were retrospectively reviewed. Low-dose enzalutamide (≤80 mg/day) was compared with standard-dose (160 mg/day). The primary objective was to compute the restricted mean survival time (RMST - time scale) and restricted mean attained age (RMAA - age scale) using the Irwin method. Secondary objectives included overall survival (OS), progression-free survival (PFS), and PSA progression per PCWG3 criteria (PSA PFS). We used the logrank test and the ∆ difference between RMSTs for comparison. RESULTS Of 111 patients treated, 32 received a low-dose and 79 the standard-dose. Low-dose patients had less prior abiraterone or chemotherapy (28.1% vs. 65.8%, P < .001); more testosterone assessment (65.6% vs. 40.5%, P = .016); poorer ECOG performance status (48.3% score ≥2 vs. 26.6%; P = .040), more comorbidities (75.9% vs. 46.3%; P = .010)) including increased cardiovascular disease (51.7% vs. 21.4%, P = .004). Baseline PSA value and doubling time at start of enzalutamide and distribution of metastases were similar between the groups. OS and PFS did not differ between low-dose and standard-dose. Patients on low-dose had a better longevity with significantly longer RMAA, 89.1 years, versus standard-dose RMAA of 83.8 years (∆ = 5.3 years, P = .003, logrank P = .025). In a subgroup analysis by age at start of enzalutamide, <75 versus ≥75 years old, longevity was also better with low-dose in younger patients (∆ = 2.9 years, P = .034, and older, ∆ = 3.3 years, P = .011). CONCLUSION The longevity advantage and reduced adverse events seen in patients with prostate cancer treated with low-dose enzalutamide warrants further investigation.
Collapse
Affiliation(s)
- Vincent Vinh-Hung
- Centre Hospitalier Universitaire de Martinique, Fort-de-France, Martinique, France
| | | | - Gilles Natchagande
- Centre National Hospitalier Universitaire Hubert K. MAGA, Cotonou, Benin
| | - Paul Sargos
- Département de radiothérapie, Institut Bergonié, Bordeaux, France
| | - Ming Yin
- Ohio State University Comprehensive Cancer Center, Columbus, OH
| | | | | | - Edmund Folefac
- Ohio State University Comprehensive Cancer Center, Columbus, OH
| |
Collapse
|
6
|
Skourlis N, Crowther MJ, Andersson TM, Lambert PC. On the choice of timescale for other cause mortality in a competing risk setting using flexible parametric survival models. Biom J 2022; 64:1161-1177. [PMID: 35708221 PMCID: PMC9795972 DOI: 10.1002/bimj.202100254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 02/13/2022] [Accepted: 04/24/2022] [Indexed: 12/30/2022]
Abstract
In competing risks settings where the events are death due to cancer and death due to other causes, it is common practice to use time since diagnosis as the timescale for all competing events. However, attained age has been proposed as a more natural choice of timescale for modeling other cause mortality. We examine the choice of using time since diagnosis versus attained age as the timescale when modeling other cause mortality, assuming that the hazard rate is a function of attained age, and how this choice can influence the cumulative incidence functions ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:mi>F</mml:mi></mml:mrow> <mml:annotation>$CIF$</mml:annotation></mml:semantics> </mml:math> s) derived using flexible parametric survival models. An initial analysis on the colon cancer data from the population-based Swedish Cancer Register indicates such an influence. A simulation study is conducted in order to assess the impact of the choice of timescale for other cause mortality on the bias of the estimated <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:mi>F</mml:mi> <mml:mi>s</mml:mi></mml:mrow> <mml:annotation>$CIFs$</mml:annotation></mml:semantics> </mml:math> and how different factors may influence the bias. We also use regression standardization methods in order to obtain marginal <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:mi>F</mml:mi></mml:mrow> <mml:annotation>$CIF$</mml:annotation></mml:semantics> </mml:math> estimates. Using time since diagnosis as the timescale for all competing events leads to a low degree of bias in <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:mi>F</mml:mi></mml:mrow> <mml:annotation>$CIF$</mml:annotation></mml:semantics> </mml:math> for cancer mortality ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:msub><mml:mi>F</mml:mi> <mml:mn>1</mml:mn></mml:msub> </mml:mrow> <mml:annotation>$CIF_{1}$</mml:annotation></mml:semantics> </mml:math> ) under all approaches. It also leads to a low degree of bias in <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:mi>F</mml:mi></mml:mrow> <mml:annotation>$CIF$</mml:annotation></mml:semantics> </mml:math> for other cause mortality ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"> <mml:semantics><mml:mrow><mml:mi>C</mml:mi> <mml:mi>I</mml:mi> <mml:msub><mml:mi>F</mml:mi> <mml:mn>2</mml:mn></mml:msub> </mml:mrow> <mml:annotation>$CIF_{2}$</mml:annotation></mml:semantics> </mml:math> ), provided that the effect of age at diagnosis is included in the model with sufficient flexibility, with higher bias under scenarios where a covariate has a time-varying effect on the hazard rate for other cause mortality on the attained age scale.
Collapse
Affiliation(s)
- Nikolaos Skourlis
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetNobels VägStockholmSweden
| | - Michael J. Crowther
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetNobels VägStockholmSweden
| | - Therese M.‐L. Andersson
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetNobels VägStockholmSweden
| | - Paul C. Lambert
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetNobels VägStockholmSweden,Biostatistics Research GroupDepartment of Health SciencesUniversity of LeicesterUniversity RoadLeicesterUK
| |
Collapse
|
7
|
Lee M, Fine JP. Competing risks predictions with different time scales under the additive risk model. Stat Med 2022; 41:3941-3957. [PMID: 35670574 DOI: 10.1002/sim.9485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 05/14/2022] [Accepted: 05/17/2022] [Indexed: 11/05/2022]
Abstract
In the analysis for competing risks data, regression modeling of the cause-specific hazard functions has been usually conducted using the same time scale for all event types. However, when the true time scale is different for each event type, it would be appropriate to specify regression models for the cause-specific hazards on different time scales for different event types. Often, the proportional hazards model has been used for regression modeling of the cause-specific hazard functions. However, the proportionality assumption may not be appropriate in practice. In this article, we consider the additive risk model as an alternative to the proportional hazards model. We propose predictions of the cumulative incidence functions under the cause-specific additive risk models employing different time scales for different event types. We establish the consistency and asymptotic normality of the predicted cumulative incidence functions under the cause-specific additive risk models specified on different time scales using empirical processes and derive consistent variance estimators of the predicted cumulative incidence functions. Through simulation studies, we show that the proposed prediction methods perform well. We illustrate the methods using stage III breast cancer data obtained from the Surveillance, Epidemiology, and End Results (SEER) program of the National Cancer Institute.
Collapse
Affiliation(s)
- Minjung Lee
- Department of Statistics, Kangwon National University, Chuncheon, Gangwon, Korea
| | - Jason P Fine
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|
8
|
Hiabu M, Nielsen JP, Scheike TH. Nonsmooth backfitting for the excess risk additive regression model with two survival time scales. Biometrika 2021. [DOI: 10.1093/biomet/asaa058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Summary
We consider an extension of Aalen’s additive regression model that allows covariates to have effects that vary on two different time scales. The two time scales considered are equal up to a constant for each individual and vary across individuals, such as follow-up time and age in medical studies or calendar time and age in longitudinal studies. The model was introduced in Scheike (2001), where it was solved using smoothing techniques. We present a new backfitting algorithm for estimating the structured model without having to use smoothing. Estimators of the cumulative regression functions on the two time scales are suggested by solving local estimating equations jointly on the two time scales. We provide large-sample properties and simultaneous confidence bands. The model is applied to data on myocardial infarction, providing a separation of the two effects stemming from time since diagnosis and age.
Collapse
Affiliation(s)
- M Hiabu
- School of Mathematics and Statistics, University of Sydney, Camperdown, New South Wales 2006, Australia
| | - J P Nielsen
- Cass Business School, City, University of London, 106 Bunhill Row, London EC1Y 8TZ, U.K
| | - T H Scheike
- Section of Biostatistics, Department of Public Health, University of Copenhagen, Øster Farimagsgade 5B, 1014 Copenhagen K, Denmark
| |
Collapse
|
9
|
Weibull CE, Lambert PC, Eloranta S, Andersson TML, Dickman PW, Crowther MJ. A multistate model incorporating estimation of excess hazards and multiple time scales. Stat Med 2021; 40:2139-2154. [PMID: 33556998 DOI: 10.1002/sim.8894] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 12/26/2020] [Accepted: 01/12/2021] [Indexed: 01/08/2023]
Abstract
As cancer patient survival improves, late effects from treatment are becoming the next clinical challenge. Chemotherapy and radiotherapy, for example, potentially increase the risk of both morbidity and mortality from second malignancies and cardiovascular disease. To provide clinically relevant population-level measures of late effects, it is of importance to (1) simultaneously estimate the risks of both morbidity and mortality, (2) partition these risks into the component expected in the absence of cancer and the component due to the cancer and its treatment, and (3) incorporate the multiple time scales of attained age, calendar time, and time since diagnosis. Multistate models provide a framework for simultaneously studying morbidity and mortality, but do not solve the problem of partitioning the risks. However, this partitioning can be achieved by applying a relative survival framework, allowing us to directly quantify the excess risk. This article proposes a combination of these two frameworks, providing one approach to address (1) to (3). Using recently developed methods in multistate modeling, we incorporate estimation of excess hazards into a multistate model. Both intermediate and absorbing state risks can be partitioned and different transitions are allowed to have different and/or multiple time scales. We illustrate our approach using data on Hodgkin lymphoma patients and excess risk of diseases of the circulatory system, and provide user-friendly Stata software with accompanying example code.
Collapse
Affiliation(s)
- Caroline E Weibull
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Clinical Epidemiology Division, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden
| | - Paul C Lambert
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, Leicester, UK
| | - Sandra Eloranta
- Clinical Epidemiology Division, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden
| | - Therese M L Andersson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Paul W Dickman
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Michael J Crowther
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, Leicester, UK
| |
Collapse
|
10
|
Lee M, Feuer EJ, Wang Z, Cho H, Zou Z, Hankey BF, Mariotto AB, Fine JP. Analyzing discrete competing risks data with partially overlapping or independent data sources and nonstandard sampling schemes, with application to cancer registries. Stat Med 2019; 38:5528-5546. [PMID: 31657494 DOI: 10.1002/sim.8381] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Revised: 08/19/2019] [Accepted: 09/07/2019] [Indexed: 11/11/2022]
Abstract
This paper demonstrates the flexibility of a general approach for the analysis of discrete time competing risks data that can accommodate complex data structures, different time scales for different causes, and nonstandard sampling schemes. The data may involve a single data source where all individuals contribute to analyses of both cause-specific hazard functions, overlapping datasets where some individuals contribute to the analysis of the cause-specific hazard function of only one cause while other individuals contribute to analyses of both cause-specific hazard functions, or separate data sources where each individual contributes to the analysis of the cause-specific hazard function of only a single cause. The approach is modularized into estimation and prediction. For the estimation step, the parameters and the variance-covariance matrix can be estimated using widely available software. The prediction step utilizes a generic program with plug-in estimates from the estimation step. The approach is illustrated with three prognostic models for stage IV male oral cancer using different data structures. The first model uses only men with stage IV oral cancer from population-based registry data. The second model strategically extends the cohort to improve the efficiency of the estimates. The third model improves the accuracy for those with a lower risk of other causes of death, by bringing in an independent data source collected under a complex sampling design with additional other-cause covariates. These analyses represent novel extensions of existing methodology, broadly applicable for the development of prognostic models capturing both the cancer and noncancer aspects of a patient's health.
Collapse
Affiliation(s)
- Minjung Lee
- Department of Statistics, Kangwon National University, Chuncheon, Gangwon, South Korea
| | - Eric J Feuer
- Statistical Research and Applications Branch, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| | - Zhuoqiao Wang
- Information Management Services, Inc, Calverton, Maryland
| | - Hyunsoon Cho
- Department of Cancer Control and Population Health, Graduate School of Cancer Science and Policy, National Cancer Center, Goyang, Gyeonggi-do, South Korea.,Division of Cancer Registration and Surveillance, National Cancer Center, Goyang, Gyeonggi-do, South Korea
| | - Zhaohui Zou
- Information Management Services, Inc, Calverton, Maryland
| | | | - Angela B Mariotto
- Data Analytics Branch, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland
| | - Jason P Fine
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina.,Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| |
Collapse
|
11
|
Boschini C, Andersen KK, Scheike TH. Excess risk estimation for matched cohort survival data. Stat Methods Med Res 2018; 28:3451-3465. [PMID: 30343631 DOI: 10.1177/0962280218804269] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
We present an excess risk regression model for matched cohort data, where the occurrence of some events for individuals with a disease is compared to that of healthy controls that are matched at the onset-of-disease by various factors. By using the matched structure, we show how to estimate the excess risk and its dependence on covariates on both proportional and additive form. We remove the individual effects on background mortality related to matching factors by considering differences. The model handles two different time scales, namely attained age and follow-up time. First, we solve estimating equations for the non-parametric and parametric components of the excess risk model, providing large sample properties for the suggested estimators. Next, we report results from a simulation study. Lastly, we describe an application of the method on childhood cancer data, to study the excess risk of cardiovascular events in adults' life among childhood cancer survivors.
Collapse
Affiliation(s)
- Cristina Boschini
- Unit of Statistics and Pharmacoepidemiology, Danish Cancer Society Research Center, Copenhagen Ø, Denmark.,Section of Biostatistics, Department of Public Health, University of Copenhagen, Copenhagen K, Denmark
| | - Klaus K Andersen
- Unit of Statistics and Pharmacoepidemiology, Danish Cancer Society Research Center, Copenhagen Ø, Denmark
| | - Thomas H Scheike
- Section of Biostatistics, Department of Public Health, University of Copenhagen, Copenhagen K, Denmark
| |
Collapse
|
12
|
Lee M, Feuer EJ, Fine JP. On the analysis of discrete time competing risks data. Biometrics 2018; 74:1468-1481. [DOI: 10.1111/biom.12881] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 03/01/2018] [Accepted: 03/01/2018] [Indexed: 11/29/2022]
Affiliation(s)
- Minjung Lee
- Department of StatisticsKangwon National UniversityChuncheonGangwon 24341South Korea
| | - Eric J. Feuer
- Statistical Research and Applications BranchDivision of Cancer Control and Population StudiesNational Cancer InstituteBethesdaMaryland 20892U.S.A
| | - Jason P. Fine
- Department of BiostatisticsUniversity of North Carolina at Chapel Hill Chapel Hill North Carolina 27599 U.S.A
- Department of StatisticsUniversity of North Carolina at Chapel Hill Chapel Hill North Carolina 27599 U.S.A
| |
Collapse
|