1
|
Ma C, Wang C, Pan J. Multistate modeling and structure selection for multitype recurrent events and terminal event data. Biom J 2023; 65:e2100334. [PMID: 36124712 DOI: 10.1002/bimj.202100334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 06/19/2022] [Accepted: 07/04/2022] [Indexed: 11/07/2022]
Abstract
In cardiovascular disease studies, a large number of risk factors are measured but it often remains unknown whether all of them are relevant variables and whether the impact of these variables is changing with time or remains constant. In addition, more than one kind of cardiovascular disease events can be observed in the same patient and events of different types are possibly correlated. It is expected that different kinds of events are associated with different covariates and the forms of covariate effects also vary between event types. To tackle these problems, we proposed a multistate modeling framework for the joint analysis of multitype recurrent events and terminal event. Model structure selection is performed to identify covariates with time-varying coefficients, time-independent coefficients, and null effects. This helps in understanding the disease process as it can detect relevant covariates and identify the temporal dynamics of the covariate effects. It also provides a more parsimonious model to achieve better risk prediction. The performance of the proposed model and selection method is evaluated in numerical studies and illustrated on a real dataset from the Atherosclerosis Risk in Communities study.
Collapse
Affiliation(s)
- Chuoxin Ma
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, China
| | - Chunyu Wang
- Department of Mathematics, University of Manchester, Manchester, UK
| | - Jianxin Pan
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, China.,Research Center for Mathematics, Beijing Normal University at Zhuhai, Zhuhai, China
| |
Collapse
|
2
|
Lee M, Zeleniuch-Jacquotte A, Liu M. Empirical evaluation of sub-cohort sampling designs for risk prediction modeling. J Appl Stat 2020; 48:1374-1401. [DOI: 10.1080/02664763.2020.1861225] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
- Myeonggyun Lee
- Department of Population Health, NYU School of Medicine, New York, NY, USA
| | - Anne Zeleniuch-Jacquotte
- Department of Population Health, NYU School of Medicine, New York, NY, USA
- Department of Environmental Medicine, NYU School of Medicine, New York, NY, USA
| | - Mengling Liu
- Department of Population Health, NYU School of Medicine, New York, NY, USA
- Department of Environmental Medicine, NYU School of Medicine, New York, NY, USA
| |
Collapse
|
3
|
Jin P, Zeleniuch-Jacquotte A, Liu M. Generalized mean residual life models for case-cohort and nested case-control studies. LIFETIME DATA ANALYSIS 2020; 26:789-819. [PMID: 32529421 PMCID: PMC7487008 DOI: 10.1007/s10985-020-09499-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Accepted: 05/25/2020] [Indexed: 06/11/2023]
Abstract
Mean residual life (MRL) is the remaining life expectancy of a subject who has survived to a certain time point and can be used as an alternative to hazard function for characterizing the distribution of a time-to-event variable. Inference and application of MRL models have primarily focused on full-cohort studies. In practice, case-cohort and nested case-control designs have been commonly used within large cohorts that have long follow-up and study rare diseases, particularly when studying costly molecular biomarkers. They enable prospective inference as the full-cohort design with significant cost-saving benefits. In this paper, we study the modeling and inference of a family of generalized MRL models under case-cohort and nested case-control designs. Built upon the idea of inverse selection probability, the weighted estimating equations are constructed to estimate regression parameters and baseline MRL function. Asymptotic properties of the proposed estimators are established and finite-sample performance is evaluated by extensive numerical simulations. An application to the New York University Women's Health Study is presented to illustrate the proposed models and demonstrate a model diagnostic method to guide practical implementation.
Collapse
Affiliation(s)
- Peng Jin
- Department of Population Health, New York University School of Medicine, New York, NY, 10016, USA
| | - Anne Zeleniuch-Jacquotte
- Department of Population Health, New York University School of Medicine, New York, NY, 10016, USA
- Department of Environmental Health, New York University School of Medicine, New York, NY, 10016, USA
| | - Mengling Liu
- Department of Population Health, New York University School of Medicine, New York, NY, 10016, USA.
- Department of Environmental Health, New York University School of Medicine, New York, NY, 10016, USA.
| |
Collapse
|
4
|
Zhang J, Chen L, Ye Y, Guo G, Chen R, Vanasse A, Wang S. Survival neural networks for time-to-event prediction in longitudinal study. Knowl Inf Syst 2020. [DOI: 10.1007/s10115-020-01472-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
5
|
Wang JH, Pan CH, Chang IS, Hsiung CA. Penalized full likelihood approach to variable selection for Cox's regression model under nested case-control sampling. LIFETIME DATA ANALYSIS 2020; 26:292-314. [PMID: 31065967 DOI: 10.1007/s10985-019-09475-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 04/26/2019] [Indexed: 06/09/2023]
Abstract
Assuming Cox's regression model, we consider penalized full likelihood approach to conduct variable selection under nested case-control (NCC) sampling. Penalized non-parametric maximum likelihood estimates (PNPMLEs) are characterized by self-consistency equations derived from score functions. A cross-validation method based on profile likelihood is used to choose the tuning parameter within a family of penalty functions. Simulation studies indicate that the numerical performance of (P)NPMLE is better than weighted partial likelihood in estimating the log-relative risk and in identifying the covariates and the model, under NCC sampling. LASSO performs best when cohort size is small; SCAD performs best when cohort size is large and may eventually perform as well as the oracle estimator. Using the SCAD penalty, we establish the consistency, asymptotic normality, and oracle properties of the PNPMLE, as well as the sparsity property of the penalty. We also propose a consistent estimate of the asymptotic variance using observed profile likelihood. Our method is illustrated to analyze the diagnosis of liver cancer among those in a type 2 diabetic mellitus dataset who were treated with thiazolidinediones in Taiwan.
Collapse
Affiliation(s)
- Jie-Huei Wang
- Division of Biostatistics and Bioinformatics, Institute of Population Health Science, National Health Research Institutes, 35, Keyan Rd., Zhunan Town, Miaoli County, 35053, Taiwan
- Institute of Statistical Science, Academia Sinica, 128, Academia Rd., Section 2, Nankang, Taipei, 11529, Taiwan
| | - Chun-Hao Pan
- Institute of Statistical Science, Academia Sinica, 128, Academia Rd., Section 2, Nankang, Taipei, 11529, Taiwan
| | - I-Shou Chang
- Division of Biostatistics and Bioinformatics, Institute of Population Health Science, National Health Research Institutes, 35, Keyan Rd., Zhunan Town, Miaoli County, 35053, Taiwan.
- National Institute of Cancer Research, National Health Research Institutes, 35, Keyan Rd., Zhunan Town, Miaoli County, 35053, Taiwan.
| | - Chao Agnes Hsiung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Science, National Health Research Institutes, 35, Keyan Rd., Zhunan Town, Miaoli County, 35053, Taiwan
| |
Collapse
|
6
|
Abstract
OBJECTIVE This study analyzed the risk factors for their possible association with overall survival and progression-free survival in cervical cancer, with a flexible model that allowed time-varying effects. METHODS Information about patients with cervical cancer from 2002 to 2012 was collected in the Kaohsiung Veterans General Hospital. All available biological and clinicopathologic factors were tested for the assumption of the Cox proportional hazard model, that is, whether they had time-varying effect on survival. The factors were also analyzed in univariate and multivariate statistics to identify independent risk factors. The multivariate analysis was performed with an extended Cox model so that those factors that failed the assumption test were allowed to vary with time. RESULTS Approximately 797 patients were included in the final analysis. Most factors tested passed the Cox assumption test, except tumor size and body mass index in the event of recurrence and preoperative CA125 values in the event of death (P < 0.05). Univariate and multivariate analysis identified tumor size, stage, and lymph nodal metastasis as independent significant risk factors for both recurrence and death (P < 0.05), with tumor size being a time-varying factor for recurrence. CONCLUSIONS Patients with larger tumor size, higher FIGO stage, and lymph nodal metastasis are faced with higher risk of recurrence and death. A larger tumor size poses increasingly higher risk for recurrence initially, and its importance declines as the patient survives longer without disease progression. These findings may be helpful to gynecologists when assessing tumor risk of patients with cervical cancer and in patient consultation.
Collapse
|
7
|
Kim G, Kim Y, Choi T. Bayesian Analysis of the Proportional Hazards Model with Time‐Varying Coefficients. Scand Stat Theory Appl 2017. [DOI: 10.1111/sjos.12263] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Gwangsu Kim
- Department of Statistics Seoul National University
- Data Science for Knowledge Creation Research Center Seoul National University
| | - Yongdai Kim
- Department of Statistics Seoul National University
| | | |
Collapse
|
8
|
Xiao W, Lu W, Zhang HH. JOINT STRUCTURE SELECTION AND ESTIMATION IN THE TIME-VARYING COEFFICIENT COX MODEL. Stat Sin 2016; 26:547-567. [PMID: 27540275 DOI: 10.5705/ss.2013.076] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Time-varying coefficient Cox model has been widely studied and popularly used in survival data analysis due to its flexibility for modeling covariate effects. It is of great practical interest to accurately identify the structure of covariate effects in a time-varying coefficient Cox model, i.e. covariates with null effect, constant effect and truly time-varying effect, and estimate the corresponding regression coefficients. Combining the ideas of local polynomial smoothing and group nonnegative garrote, we develop a new penalization approach to achieve such goals. Our method is able to identify the underlying true model structure with probability tending to one and simultaneously estimate the time-varying coefficients consistently. The asymptotic normalities of the resulting estimators are also established. We demonstrate the performance of our method using simulations and an application to the primary biliary cirrhosis data.
Collapse
Affiliation(s)
- Wei Xiao
- North Carolina State University and University of Arizona
| | - Wenbin Lu
- North Carolina State University and University of Arizona
| | | |
Collapse
|
9
|
Rivera CL, Lumley T. Using the entire history in the analysis of nested case cohort samples. Stat Med 2016; 35:3213-28. [DOI: 10.1002/sim.6917] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Revised: 01/20/2016] [Accepted: 01/31/2016] [Indexed: 11/10/2022]
Affiliation(s)
- C. L. Rivera
- Department of Biostatistics; Harvard School of Public Health; 677 Huntington Avenue, Kresge 803B Boston MA 02115 U.S.A
| | - T. Lumley
- Department of Biostatistics; Harvard School of Public Health; 677 Huntington Avenue, Kresge 803B Boston MA 02115 U.S.A
| |
Collapse
|
10
|
Zhou QM, Zheng Y, Chibnik LB, Karlson EW, Cai T. Assessing incremental value of biomarkers with multi-phase nested case-control studies. Biometrics 2015. [PMID: 26195245 DOI: 10.1111/biom.12344] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Accurate risk prediction models are needed to identify different risk groups for individualized prevention and treatment strategies. In the Nurses' Health Study, to examine the effects of several biomarkers and genetic markers on the risk of rheumatoid arthritis (RA), a three-phase nested case-control (NCC) design was conducted, in which two sequential NCC subcohorts were formed with one nested within the other, and one set of new markers measured on each of the subcohorts. One objective of the study is to evaluate clinical values of novel biomarkers in improving upon existing risk models because of potential cost associated with assaying biomarkers. In this paper, we develop robust statistical procedures for constructing risk prediction models for RA and estimating the incremental value (IncV) of new markers based on three-phase NCC studies. Our method also takes into account possible time-varying effects of biomarkers in risk modeling, which allows us to more robustly assess the biomarker utility and address the question of whether a marker is better suited for short-term or long-term risk prediction. The proposed procedures are shown to perform well in finite samples via simulation studies.
Collapse
Affiliation(s)
- Qian M Zhou
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, Canada, V5A1S6
| | - Yingye Zheng
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Lori B Chibnik
- Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | | | - Tianxi Cai
- Department of Biostatistics, Harvard University, Boston, MA, USA
| |
Collapse
|
11
|
Chang C, Chiang AJ, Wang HC, Chen WA, Chen J. Evaluation of the Time-Varying Effect of Prognostic Factors on Survival in Ovarian Cancer. Ann Surg Oncol 2015; 22:3976-80. [DOI: 10.1245/s10434-015-4493-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Indexed: 11/18/2022]
|