1
|
Lee M, Troxel AB, Liu M. Partial-linear single-index transformation models with censored data. LIFETIME DATA ANALYSIS 2024:10.1007/s10985-024-09624-z. [PMID: 38625444 DOI: 10.1007/s10985-024-09624-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 03/26/2024] [Indexed: 04/17/2024]
Abstract
In studies with time-to-event outcomes, multiple, inter-correlated, and time-varying covariates are commonly observed. It is of great interest to model their joint effects by allowing a flexible functional form and to delineate their relative contributions to survival risk. A class of semiparametric transformation (ST) models offers flexible specifications of the intensity function and can be a general framework to accommodate nonlinear covariate effects. In this paper, we propose a partial-linear single-index (PLSI) transformation model that reduces the dimensionality of multiple covariates into a single index and provides interpretable estimates of the covariate effects. We develop an iterative algorithm using the regression spline technique to model the nonparametric single-index function for possibly nonlinear joint effects, followed by nonparametric maximum likelihood estimation. We also propose a nonparametric testing procedure to formally examine the linearity of covariate effects. We conduct Monte Carlo simulation studies to compare the PLSI transformation model with the standard ST model and apply it to NYU Langone Health de-identified electronic health record data on COVID-19 hospitalized patients' mortality and a Veteran's Administration lung cancer trial.
Collapse
Affiliation(s)
- Myeonggyun Lee
- Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine, New York, NY, 10016, USA.
| | - Andrea B Troxel
- Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Mengling Liu
- Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine, New York, NY, 10016, USA
| |
Collapse
|
2
|
Devasia TP, Tsodikov A. Efficiency of the Breslow estimator in semiparametric transformation models. LIFETIME DATA ANALYSIS 2024; 30:291-309. [PMID: 38007694 PMCID: PMC11237962 DOI: 10.1007/s10985-023-09611-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 10/04/2023] [Indexed: 11/28/2023]
Abstract
Semiparametric transformation models for failure time data consist of a parametric regression component and an unspecified cumulative baseline hazard. The nonparametric maximum likelihood estimator (NPMLE) of the cumulative baseline hazard can be summarized in terms of weights introduced into a Breslow-type estimator (Weighted Breslow). At any given time point, the weights invoke an integral over the future of the cumulative baseline hazard, which presents theoretical and computational challenges. A simpler non-MLE Breslow-type estimator (Breslow) was derived earlier from a martingale estimating equation (MEE) setting observed and expected counts of failures equal, conditional on the past history. Despite much successful theoretical and computational development, the simpler Breslow estimator continues to be commonly used as a compromise between simplicity and perceived loss of full efficiency. In this paper we derive the relative efficiency of the Breslow estimator and consider the properties of the two estimators using simulations and real data on prostate cancer survival.
Collapse
Affiliation(s)
- Theresa P. Devasia
- Data Analytics Branch, Division of Cancer Control and Population Sciences, National Cancer Institute, 9609 Medical Center Drive, Rockville, MD 20850, USA
| | - Alexander Tsodikov
- Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48105, USA
| |
Collapse
|
3
|
Wang J, Zeng D, Lin DY. Fitting the Cox proportional hazards model to big data. Biometrics 2024; 80:ujae018. [PMID: 38497824 PMCID: PMC10946235 DOI: 10.1093/biomtc/ujae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 01/14/2024] [Accepted: 02/20/2024] [Indexed: 03/19/2024]
Abstract
The semiparametric Cox proportional hazards model, together with the partial likelihood principle, has been widely used to study the effects of potentially time-dependent covariates on a possibly censored event time. We propose a computationally efficient method for fitting the Cox model to big data involving millions of study subjects. Specifically, we perform maximum partial likelihood estimation on a small subset of the whole data and improve the initial estimator by incorporating the remaining data through one-step estimation with estimated efficient score functions. We show that the final estimator has the same asymptotic distribution as the conventional maximum partial likelihood estimator using the whole dataset but requires only a small fraction of computation time. We demonstrate the usefulness of the proposed method through extensive simulation studies and an application to the UK Biobank data.
Collapse
Affiliation(s)
- Jianqiao Wang
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
| | - Dan-Yu Lin
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
| |
Collapse
|
4
|
Ho YL, Hong JS, Huang YT. Model-based hypothesis tests for the causal mediation of semi-competing risks. LIFETIME DATA ANALYSIS 2024; 30:119-142. [PMID: 36949266 DOI: 10.1007/s10985-023-09595-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 02/26/2023] [Indexed: 06/18/2023]
Abstract
Analyzing the causal mediation of semi-competing risks has become important in medical research. Semi-competing risks refers to a scenario wherein an intermediate event may be censored by a primary event but not vice versa. Causal mediation analyses decompose the effect of an exposure on the primary outcome into an indirect (mediation) effect: an effect mediated through a mediator, and a direct effect: an effect not through the mediator. Here we proposed a model-based testing procedure to examine the indirect effect of the exposure on the primary event through the intermediate event. Under the counterfactual outcome framework, we defined a causal mediation effect using counting process. To assess statistical evidence for the mediation effect, we proposed two tests: an intersection-union test (IUT) and a weighted log-rank test (WLR). The test statistic was developed from a semi-parametric estimator of the mediation effect using a Cox proportional hazards model for the primary event and a series of logistic regression models for the intermediate event. We built a connection between the IUT and WLR. Asymptotic properties of the two tests were derived, and the IUT was determined to be a size [Formula: see text] test and statistically more powerful than the WLR. In numerical simulations, both the model-based IUT and WLR can properly adjust for confounding covariates, and the Type I error rates of the proposed methods are well protected, with the IUT being more powerful than the WLR. Our methods demonstrate the strongly significant effects of hepatitis B or C on the risk of liver cancer mediated through liver cirrhosis incidence in a prospective cohort study. The proposed method is also applicable to surrogate endpoint analyses in clinical trials.
Collapse
Affiliation(s)
- Yun-Lin Ho
- Institute of Applied Mathematical Sciences, National Taiwan University, Taipei, Taiwan
| | - Ju-Sheng Hong
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| | - Yen-Tsung Huang
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.
| |
Collapse
|
5
|
Wen F, Li C, Liang B, You J, Li X, Wang J, Liu H, Wang F, Dong Z, Zhang Y. Efficacy of high-dose-rate brachytherapy with different radiation source activities among cervical cancer patients and risk factors for long-term outcomes: A 6-year retrospective study. Brachytherapy 2024; 23:35-44. [PMID: 37919124 DOI: 10.1016/j.brachy.2023.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/20/2023] [Accepted: 09/14/2023] [Indexed: 11/04/2023]
Abstract
PURPOSE This study aimed to assess the impact of dose rates due to natural decay of Iridium-192 sources and the risk factors of clinical outcomes for cervical cancer patients treated with high-dose-rate (HDR) brachytherapy. METHODS AND MATERIALS Four ninety-four patients were divided into relatively-high-radioactive (rHR), relatively-medium-radioactive (rMR), and relatively-low-radioactive (rLR) groups for retrospective treatment response comparison. The short-term outcomes were evaluated using the 1-month /3-month follow-up results based on RECIST 1.1. Local recurrence-free survival (LRFS) and metastatic recurrence-free survival (MRFS) were selected as long-term outcomes. A class of transformation models with adaptive lasso was applied to assess the risk factors of long-term outcomes. RESULTS No significant difference was identified in short- or long-term outcomes of different radioactive groups. Subgroup analyses demonstrated similar findings. In multivariate factor analysis, advanced stage was significantly associated with higher risk of local recurrence and metastatic recurrence (HR = 1.66, 95%confidence interval [CI] = 1.14-2.43, p = 0.008; HR = 1.57, 95%CI = 1.23-2.00, p < 0.001). Significant associations were observed between local recurrence and pathology, and between metastatic recurrence and pre-treatment serum indices, respectively (HR = 8.62, 95%CI = 2.28-32.60, p = 0.002; HR = 1.98, 95%CI=1.20-2.26, p = 0.008). CONCLUSIONS Overall, there was no significant difference in long- or short-term efficacy of the HDR brachytherapy among the groups with different levels of activity of radiation sources. Stage, pathology, and pretreatment serum indices were crucial factors that affected the long-term outcomes.
Collapse
Affiliation(s)
- Fengyu Wen
- Department of Health Data Science, Institute of Medical Technology, Peking University Health Science Center, Beijing, China
| | - Chenguang Li
- Department of Radiation Oncology Physics, Institute of Medical Technology, Peking University, Beijing, China; Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China
| | - Baosheng Liang
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Jing You
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China
| | - Xiaofan Li
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China
| | - Jingyuan Wang
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Hongjia Liu
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China
| | - Fulin Wang
- Department of Health Data Science, Institute of Medical Technology, Peking University Health Science Center, Beijing, China
| | - Zhengkun Dong
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China
| | - Yibao Zhang
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China.
| |
Collapse
|
6
|
Li C, Tian Y, Zeng D, Shepherd BE. Asymptotic Properties for Cumulative Probability Models for Continuous Outcomes. MATHEMATICS (BASEL, SWITZERLAND) 2023; 11:4896. [PMID: 38374966 PMCID: PMC10875740 DOI: 10.3390/math11244896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
Regression models for continuous outcomes frequently require a transformation of the outcome, which is often specified a priori or estimated from a parametric family. Cumulative probability models (CPMs) nonparametrically estimate the transformation by treating the continuous outcome as if it is ordered categorically. They thus represent a flexible analysis approach for continuous outcomes. However, it is difficult to establish asymptotic properties for CPMs due to the potentially unbounded range of the transformation. Here we show asymptotic properties for CPMs when applied to slightly modified data where bounds, one lower and one upper, are chosen and the outcomes outside the bounds are set as two ordinal categories. We prove the uniform consistency of the estimated regression coefficients and of the estimated transformation function between the bounds. We also describe their joint asymptotic distribution, and show that the estimated regression coefficients attain the semiparametric efficiency bound. We show with simulations that results from this approach and those from using the CPM on the original data are very similar when a small fraction of the data are modified. We reanalyze a dataset of HIV-positive patients with CPMs to illustrate and compare the approaches.
Collapse
Affiliation(s)
- Chun Li
- Division of Biostatistics, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90033, USA
| | - Yuqi Tian
- Department of Biostatistics, Vanderbilt University, Nashville, TN 37203, USA
| | - Donglin Zeng
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Bryan E. Shepherd
- Department of Biostatistics, Vanderbilt University, Nashville, TN 37203, USA
| |
Collapse
|
7
|
Ning X, Pan Y, Sun Y, Gilbert PB. A semiparametric Cox-Aalen transformation model with censored data. Biometrics 2023; 79:3111-3125. [PMID: 37403227 PMCID: PMC10764654 DOI: 10.1111/biom.13895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 05/31/2023] [Indexed: 07/06/2023]
Abstract
We propose a broad class of so-called Cox-Aalen transformation models that incorporate both multiplicative and additive covariate effects on the baseline hazard function within a transformation. The proposed models provide a highly flexible and versatile class of semiparametric models that include the transformation models and the Cox-Aalen model as special cases. Specifically, it extends the transformation models by allowing potentially time-dependent covariates to work additively on the baseline hazard and extends the Cox-Aalen model through a predetermined transformation function. We propose an estimating equation approach and devise an expectation-solving (ES) algorithm that involves fast and robust calculations. The resulting estimator is shown to be consistent and asymptotically normal via modern empirical process techniques. The ES algorithm yields a computationally simple method for estimating the variance of both parametric and nonparametric estimators. Finally, we demonstrate the performance of our procedures through extensive simulation studies and applications in two randomized, placebo-controlled human immunodeficiency virus (HIV) prevention efficacy trials. The data example shows the utility of the proposed Cox-Aalen transformation models in enhancing statistical power for discovering covariate effects.
Collapse
Affiliation(s)
- Xi Ning
- Department of Mathematics and Statistics, University of North Carolina at Charlotte, Charlotte, North Carolina, U.S.A
| | - Yinghao Pan
- Department of Mathematics and Statistics, University of North Carolina at Charlotte, Charlotte, North Carolina, U.S.A
| | - Yanqing Sun
- Department of Mathematics and Statistics, University of North Carolina at Charlotte, Charlotte, North Carolina, U.S.A
| | - Peter B. Gilbert
- Department of Biostatistics, University of Washington, Seattle, Washington, U.S.A
- Vaccine and Infectious Disease and Public Health Sciences Divisions, Fred Hutchinson Cancer Center, Seattle, Washington, U.S.A
| |
Collapse
|
8
|
Eden SK, Li C, Shepherd BE. Spearman-like correlation measure adjusting for covariates in bivariate survival data. Biom J 2023; 65:e2200137. [PMID: 37753794 PMCID: PMC10897866 DOI: 10.1002/bimj.202200137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 02/15/2023] [Accepted: 05/08/2023] [Indexed: 09/28/2023]
Abstract
We propose an extension of Spearman's correlation for censored continuous and discrete data that permits covariate adjustment. Previously proposed nonparametric and semiparametric Spearman's correlation estimators require either nonparametric estimation of the bivariate survival surface or parametric assumptions about the dependence structure. In practice, nonparametric estimation of the bivariate survival surface is difficult, and parametric assumptions about the correlation structure may not be satisfied. Therefore, we propose a method that requires neither and uses only the marginal survival distributions. Our method estimates the correlation of probability-scale residuals, which has been shown to equal Spearman's correlation when there is no censoring. Because this method relies only on marginal distributions, it tends to be less variable than the previously suggested nonparametric estimators, and the confidence intervals are easily constructed. Although under censoring, it is biased for Spearman's correlation as our simulations show, it performs well under moderate censoring, with a smaller mean squared error than nonparametric approaches. We also extend it to partial (adjusted), conditional, and partial-conditional correlation, which makes it particularly relevant for practical applications. We apply our method to estimate the correlation between time to viral failure and time to regimen change in a multisite cohort of persons living with HIV in Latin America.
Collapse
Affiliation(s)
- Svetlana K. Eden
- Department of Biostatistics, Vanderbilt University Medical Center, 2525 West End Avenue, Suite 11000, Nashville, TN 37203, USA
| | - Chun Li
- Department of Population and Public Health Sciences, University of Southern California, 2001 North Soto Street, Los Angeles, CA 90033, USA
| | - Bryan E. Shepherd
- Department of Biostatistics, Vanderbilt University Medical Center, 2525 West End Avenue, Suite 11000, Nashville, TN 37203, USA
| |
Collapse
|
9
|
Yin Lee C, Wong KY. Survival analysis with a random change-point. Stat Methods Med Res 2023; 32:2083-2095. [PMID: 37559549 DOI: 10.1177/09622802231192946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/11/2023]
Abstract
Contemporary works in change-point survival models mainly focus on an unknown universal change-point shared by the whole study population. However, in some situations, the change-point is plausibly individual-specific, such as when it corresponds to the telomere length or menopausal age. Also, maximum-likelihood-based inference for the fixed change-point parameter is notoriously complicated. The asymptotic distribution of the maximum-likelihood estimator is non-standard, and computationally intensive bootstrap techniques are commonly used to retrieve its sampling distribution. This article is motivated by a breast cancer study, where the disease-free survival time of the patients is postulated to be regulated by the menopausal age, which is unobserved. As menopausal age varies across patients, a fixed change-point survival model may be inadequate. Therefore, we propose a novel proportional hazards model with a random change-point. We develop a nonparametric maximum-likelihood estimation approach and devise a stable expectation-maximization algorithm to compute the estimators. Because the model is regular, we employ conventional likelihood theory for inference based on the asymptotic normality of the Euclidean parameter estimators, and the variance of the asymptotic distribution can be consistently estimated by a profile-likelihood approach. A simulation study demonstrates the satisfactory finite-sample performance of the proposed methods, which yield small bias and proper coverage probabilities. The methods are applied to the motivating breast cancer study.
Collapse
Affiliation(s)
- Chun Yin Lee
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong
| | - Kin Yau Wong
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong
- Department of Applied Mathematics, Hong Kong Polytechnic University Shenzhen Research Institute, Hong Kong
| |
Collapse
|
10
|
Xu Y, Zeng D, Lin DY. Marginal proportional hazards models for multivariate interval-censored data. Biometrika 2023; 110:815-830. [PMID: 37601305 PMCID: PMC10434824 DOI: 10.1093/biomet/asac059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/22/2023] Open
Abstract
Multivariate interval-censored data arise when there are multiple types of events or clusters of study subjects, such that the event times are potentially correlated and when each event is only known to occur over a particular time interval. We formulate the effects of potentially time-varying covariates on the multivariate event times through marginal proportional hazards models while leaving the dependence structures of the related event times unspecified. We construct the nonparametric pseudolikelihood under the working assumption that all event times are independent, and we provide a simple and stable EM-type algorithm. The resulting nonparametric maximum pseudolikelihood estimators for the regression parameters are shown to be consistent and asymptotically normal, with a limiting covariance matrix that can be consistently estimated by a sandwich estimator under arbitrary dependence structures for the related event times. We evaluate the performance of the proposed methods through extensive simulation studies and present an application to data from the Atherosclerosis Risk in Communities Study.
Collapse
Affiliation(s)
- Yangjianchen Xu
- Department of Biostatistics, University of North Carolina, 3101E McGavran-Greenberg Hall, Chapel Hill, North Carolina 27599, U.S.A
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina, 3101E McGavran-Greenberg Hall, Chapel Hill, North Carolina 27599, U.S.A
| | - D Y Lin
- Department of Biostatistics, University of North Carolina, 3101E McGavran-Greenberg Hall, Chapel Hill, North Carolina 27599, U.S.A
| |
Collapse
|
11
|
Cheng YJ, Liu YC, Tsai CY, Huang CY. Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity. Biometrics 2023; 79:1996-2009. [PMID: 36314375 DOI: 10.1111/biom.13778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 10/05/2022] [Indexed: 09/13/2023]
Abstract
Leveraging information in aggregate data from external sources to improve estimation efficiency and prediction accuracy with smaller scale studies has drawn a great deal of attention in recent years. Yet, conventional methods often either ignore uncertainty in the external information or fail to account for the heterogeneity between internal and external studies. This article proposes an empirical likelihood-based framework to improve the estimation of the semiparametric transformation models by incorporating information about the t-year subgroup survival probability from external sources. The proposed estimation procedure incorporates an additional likelihood component to account for uncertainty in the external information and employs a density ratio model to characterize population heterogeneity. We establish the consistency and asymptotic normality of the proposed estimator and show that it is more efficient than the conventional pseudopartial likelihood estimator without combining information. Simulation studies show that the proposed estimator yields little bias and outperforms the conventional approach even in the presence of information uncertainty and heterogeneity. The proposed methodologies are illustrated with an analysis of a pancreatic cancer study.
Collapse
Affiliation(s)
- Yu-Jen Cheng
- Institute of Statistics, National Tsing Hua University, Hsin-Chu, Taiwan
| | - Yen-Chun Liu
- Institute of Statistics, National Tsing Hua University, Hsin-Chu, Taiwan
| | - Chang-Yu Tsai
- Institute of Statistics, National Tsing Hua University, Hsin-Chu, Taiwan
| | - Chiung-Yu Huang
- Department of Epidemiology & Biostatistics, University of California at San Francisco, San Francisco, California, USA
| |
Collapse
|
12
|
Zhong W, Diao G. Joint semiparametric models for case-cohort designs. Biometrics 2023; 79:1959-1971. [PMID: 35917392 DOI: 10.1111/biom.13728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 07/20/2022] [Indexed: 11/28/2022]
Abstract
Two-phase studies such as case-cohort and nested case-control studies are widely used cost-effective sampling strategies. In the first phase, the observed failure/censoring time and inexpensive exposures are collected. In the second phase, a subgroup of subjects is selected for measurements of expensive exposures based on the information from the first phase. One challenging issue is how to utilize all the available information to conduct efficient regression analyses of the two-phase study data. This paper proposes a joint semiparametric modeling of the survival outcome and the expensive exposures. Specifically, we assume a class of semiparametric transformation models and a semiparametric density ratio model for the survival outcome and the expensive exposures, respectively. The class of semiparametric transformation models includes the proportional hazards model and the proportional odds model as special cases. The density ratio model is flexible in modeling multivariate mixed-type data. We develop efficient likelihood-based estimation and inference procedures and establish the large sample properties of the nonparametric maximum likelihood estimators. Extensive numerical studies reveal that the proposed methods perform well under practical settings. The proposed methods also appear to be reasonably robust under various model mis-specifications. An application to the National Wilms Tumor Study is provided.
Collapse
Affiliation(s)
- Weibin Zhong
- Global Biometrics & Data Sciences, Bristol Myers Squibb, Berkeley Heights, New Jersey, USA
| | - Guoqing Diao
- Department of Biostatistics and Bioinformatics, The George Washington University, Washington, District of Columbia, USA
| |
Collapse
|
13
|
Yang Y, Yao F, Zhao P. Online Smooth Backfitting for Generalized Additive Models. J Am Stat Assoc 2023. [DOI: 10.1080/01621459.2023.2182213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Affiliation(s)
- Ying Yang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Fang Yao
- Department of Probability and Statistics, School of Mathematical Sciences, Center for Statistical Science, Peking University, Beijing, China
| | - Peng Zhao
- School of Mathematics and Statistics and Research Institute of Mathematical Sciences(RIMS), Jiangsu Normal University, Xuzhou, China
| |
Collapse
|
14
|
He Y, Kim S, Mao L, Woo Ahn K. Marginal semiparametric transformation models for clustered multivariate competing risks data. Stat Med 2022; 41:5349-5364. [PMID: 36117139 PMCID: PMC9650627 DOI: 10.1002/sim.9573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 08/29/2022] [Accepted: 08/31/2022] [Indexed: 11/10/2022]
Abstract
Multivariate survival models are often used in studying multiple outcomes for right-censored data. However, the outcomes of interest often have competing risks, where standard multivariate survival models may lead to invalid inferences. For example, patients who had stem cell transplantation may experience multiple types of infections after transplant while reconstituting their immune system, where death without experiencing infections is a competing risk for infections. Such competing risks data often suffer from cluster effects due to a matched pair design or correlation within study centers. The cumulative incidence function (CIF) is widely used to summarize competing risks outcomes. Thus, it is often of interest to study direct covariate effects on the CIF. Most literature on clustered competing risks data analyses is limited to the univariate proportional subdistribution hazards model with inverse probability censoring weighting which requires correctly specifying the censoring distribution. We propose a marginal semiparametric transformation model for multivariate competing risks outcomes. The proposed model does not require modeling the censoring distribution, accommodates nonproportional subdistribution hazards structure, and provides a platform for joint inference of all causes and outcomes.
Collapse
Affiliation(s)
- Yizeng He
- Division of Biostatistics, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| | - Soyoung Kim
- Division of Biostatistics, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| | - Lu Mao
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Kwang Woo Ahn
- Division of Biostatistics, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| |
Collapse
|
15
|
Bouzebda S, El-hadjali T, Ferfache AA. Central limit theorems for functional Z-estimators with functional nuisance parameters. COMMUN STAT-THEOR M 2022. [DOI: 10.1080/03610926.2022.2138439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Salim Bouzebda
- Université de technologie de Compiègne, LMAC (Laboratory of Applied Mathematics of Compiègne), Compiègne Cedex
| | - Thouria El-hadjali
- Département de Mathématiques, Université Frères Mentouri, Constantine 1, Algérie
| | | |
Collapse
|
16
|
Wang J, Zeng D, Lin DY. Semiparametric single-index models for optimal treatment regimens with censored outcomes. LIFETIME DATA ANALYSIS 2022; 28:744-763. [PMID: 35939142 DOI: 10.1007/s10985-022-09566-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 06/07/2022] [Indexed: 06/15/2023]
Abstract
There is a growing interest in precision medicine, where a potentially censored survival time is often the most important outcome of interest. To discover optimal treatment regimens for such an outcome, we propose a semiparametric proportional hazards model by incorporating the interaction between treatment and a single index of covariates through an unknown monotone link function. This model is flexible enough to allow non-linear treatment-covariate interactions and yet provides a clinically interpretable linear rule for treatment decision. We propose a sieve maximum likelihood estimation approach, under which the baseline hazard function is estimated nonparametrically and the unknown link function is estimated via monotone quadratic B-splines. We show that the resulting estimators are consistent and asymptotically normal with a covariance matrix that attains the semiparametric efficiency bound. The optimal treatment rule follows naturally as a linear combination of the maximum likelihood estimators of the model parameters. Through extensive simulation studies and an application to an AIDS clinical trial, we demonstrate that the treatment rule derived from the single-index model outperforms the treatment rule under the standard Cox proportional hazards model.
Collapse
Affiliation(s)
- Jin Wang
- Department of Biostatistics, University Of North Carolina, Chapel Hill, NC, United States
| | - Donglin Zeng
- Department of Biostatistics, University Of North Carolina, Chapel Hill, NC, United States
| | - D Y Lin
- Department of Biostatistics, University Of North Carolina, Chapel Hill, NC, United States.
| |
Collapse
|
17
|
Jiang Q, Xia Y, Liang B. Matching distributions for survival data. CAN J STAT 2022. [DOI: 10.1002/cjs.11641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Qiang Jiang
- School of Statistics Southwestern University of Finance and Economics Chengdu P.R. China
| | - Yifan Xia
- Institute of Medical Technology Peking University Beijing P.R. China
| | - Baosheng Liang
- Department of Biostatistics School of Public Health, Peking University Beijing P.R. China
| |
Collapse
|
18
|
Qiu Z, Ma H, Shi J. Reweighting estimators for the transformation models with length-biased sampling data and missing covariates. COMMUN STAT-THEOR M 2022. [DOI: 10.1080/03610926.2020.1812653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Zhiping Qiu
- School of Statistics, Huaqiao University, Xiamen, China
- Research Center for Applied Statistics and Big Data, Huaqiao University, Xiamen, China
| | - Huijuan Ma
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science, Ministry of Education, East China Normal University, Shanghai, China
- Academy of Statistics and Interdisciplinary Sciences, East China Normal University, Shanghai, China
| | - Jianhua Shi
- School of Mathematics and Statistics, Minnan Normal University, Zhangzhou, China
| |
Collapse
|
19
|
Pan D, Song X, Pan J. Joint analysis of multivariate failure time data with latent variables. Stat Methods Med Res 2022; 31:1292-1312. [DOI: 10.1177/09622802221089028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
We propose a joint modeling approach to investigate the observed and latent risk factors of the multivariate failure times of interest. The proposed model comprises two parts. The first part is a distribution-free confirmatory factor analysis model that characterizes the latent factors by correlated multiple observed variables. The second part is a multivariate additive hazards model that assesses the observed and latent risk factors of the failure times. A hybrid procedure that combines the borrow-strength estimation approach and the asymptotically distribution-free generalized least square method is developed to estimate the model parameters. The asymptotic properties of the proposed estimators are derived. Simulation studies demonstrate that the proposed method performs well for practical settings. An application to a study concerning the risk factors of multiple diabetic complications is provided.
Collapse
Affiliation(s)
- Deng Pan
- School of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan, China
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| | - Junhao Pan
- Department of Psychology, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
20
|
Tang W, He K, Xu G, Zhu J. Survival Analysis via Ordinary Differential Equations. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2051519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Weijing Tang
- Department of Statistics, University of Michigan, Ann Arbor, Michigan
| | - Kevin He
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, Michigan
| | - Gongjun Xu
- Department of Statistics, University of Michigan, Ann Arbor, Michigan
| | - Ji Zhu
- Department of Statistics, University of Michigan, Ann Arbor, Michigan
| |
Collapse
|
21
|
Jiang H, Huang L, Xia Y. Nonparametric regression with right‐censored covariate via conditional density function. Stat Med 2022; 41:2025-2051. [DOI: 10.1002/sim.9343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 12/19/2021] [Accepted: 01/17/2022] [Indexed: 11/11/2022]
Affiliation(s)
- Hui Jiang
- School of Mathematics and Statistics Huazhong University of Science and Technology Wuhan China
| | - Lei Huang
- School of Mathematics Southwest Jiaotong University Chengdu China
| | - Yingcun Xia
- Department of Statistics and Data Science National University of Singapore Singapore
- School of Mathematics University of Electronic Science and Technology of China Chengdu China
| |
Collapse
|
22
|
WONG KINYAU, ZENG DONGLIN, LIN DY. SEMIPARAMETRIC LATENT-CLASS MODELS FOR MULTIVARIATE LONGITUDINAL AND SURVIVAL DATA. Ann Stat 2022; 50:487-510. [PMID: 35813218 PMCID: PMC9269993 DOI: 10.1214/21-aos2117] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
In long-term follow-up studies, data are often collected on repeated measures of multivariate response variables as well as on time to the occurrence of a certain event. To jointly analyze such longitudinal data and survival time, we propose a general class of semiparametric latent-class models that accommodates a heterogeneous study population with flexible dependence structures between the longitudinal and survival outcomes. We combine nonparametric maximum likelihood estimation with sieve estimation and devise an efficient EM algorithm to implement the proposed approach. We establish the asymptotic properties of the proposed estimators through novel use of modern empirical process theory, sieve estimation theory, and semiparametric efficiency theory. Finally, we demonstrate the advantages of the proposed methods through extensive simulation studies and provide an application to the Atherosclerosis Risk in Communities study.
Collapse
Affiliation(s)
- KIN YAU WONG
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong
| | - DONGLIN ZENG
- Department of Biostatistics, University of North Carolina at Chapel Hill, USA
| | - D. Y. LIN
- Department of Biostatistics, University of North Carolina at Chapel Hill, USA
| |
Collapse
|
23
|
Wang C, Jiang J, Song X. Bayesian transformation models with partly interval-censored data. Stat Med 2021; 41:1263-1279. [PMID: 34845732 DOI: 10.1002/sim.9271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 10/27/2021] [Accepted: 11/04/2021] [Indexed: 11/07/2022]
Abstract
In many scientific fields, partly interval-censored data, which consist of exactly observed and interval-censored observations on the failure time of interest, appear frequently. However, methodological developments in the analysis of partly interval-censored data are relatively limited and have mainly focused on additive or proportional hazards models. The general linear transformation model provides a highly flexible modeling framework that includes several familiar survival models as special cases. Despite such nice features, the inference procedure for this class of models has not been developed for partly interval-censored data. We propose a fully Bayesian approach coped with efficient Markov chain Monte Carlo methods to fill this gap. A four-stage data augmentation procedure is introduced to tackle the challenges presented by the complex model and data structure. The proposed method is easy to implement and computationally attractive. The empirical performance of the proposed method is evaluated through two simulation studies, and the model is then applied to a dental health study.
Collapse
Affiliation(s)
- Chunjie Wang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, China
| | - Jingjing Jiang
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, China
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
24
|
Sun L, Li S, Wang L, Song X. A semiparametric mixture model approach for regression analysis of partly interval-censored data with a cured subgroup. Stat Methods Med Res 2021; 30:1890-1903. [PMID: 34197261 DOI: 10.1177/09622802211023985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Failure time data with a cured subgroup are frequently confronted in various scientific fields and many methods have been proposed for their analysis under right or interval censoring. However, a cure model approach does not seem to exist in the analysis of partly interval-censored data, which consist of both exactly observed and interval-censored observations on the failure time of interest. In this article, we propose a two-component mixture cure model approach for analyzing such type of data. We employ a logistic model to describe the cured probability and a proportional hazards model to model the latent failure time distribution for uncured subjects. We consider maximum likelihood estimation and develop a new expectation-maximization algorithm for its implementation. The asymptotic properties of the resulting estimators are established and the finite sample performance of the proposed method is examined through simulation studies. An application to a set of real data on childhood mortality in Nigeria is provided.
Collapse
Affiliation(s)
- Liuquan Sun
- School of Economics and Statistics, Guangzhou University, Guangzhou, China.,Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Lianming Wang
- Department of Statistics, University of South Carolina, Columbia, USA
| | - Xinyuan Song
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong
| |
Collapse
|
25
|
Wei B, Peng L, Zhang MJ, Fine JP. Estimation of causal quantile effects with a binary instrumental variable and censored data. J R Stat Soc Series B Stat Methodol 2021; 83:559-578. [PMID: 35444487 PMCID: PMC9015211 DOI: 10.1111/rssb.12431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The causal effect of a treatment is of fundamental interest in the social, biological, and health sciences. Instrumental variable (IV) methods are commonly used to determine causal treatment effects in the presence of unmeasured confounding. In this work, we study a new binary IV framework with randomly censored outcomes where we propose to quantify the causal treatment effect by the concept of complier quantile causal effect (CQCE). The CQCE is identifiable under weaker conditions than the complier average causal effect when outcomes are subject to censoring, and it can provide useful insight into the dynamics of the causal treatment effect. Employing the special characteristic of the binary IV and adapting the principle of conditional score, we uncover a simple weighting scheme that can be incorporated into the standard censored quantile regression procedure to estimate CQCE. We develop robust nonparametric estimation of the derived weights in the first stage, which permits stable implementation of the second stage estimation based on existing software. We establish rigorous asymptotic properties for the proposed estimator, and confirm its validity and satisfactory finite-sample performance via extensive simulations. The proposed method is applied to a bone marrow transplant dataset to evaluate the causal effect of rituximab in diffuse large B-cell lymphoma patients.
Collapse
Affiliation(s)
- Bo Wei
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, USA
| | - Limin Peng
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, USA
| | - Mei-Jie Zhang
- Department of Biostatistics, Medical College of Wisconsin
| | - Jason P. Fine
- Department of Biostatistics, University of North Carolina-Chapel Hill
| |
Collapse
|
26
|
Wu J, Lu X, Zhong W. Bi-level variable selection in semiparametric transformation mixture cure models for right-censored data. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.1926499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Jingjing Wu
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
| | - Xuewen Lu
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
| | - Wenyan Zhong
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
- Department of Biostatistics and Research Decision Sciences, MSD China, Shanghai, China
| |
Collapse
|
27
|
Choi S, Huang X. Efficient inferences for linear transformation models with doubly censored data. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2019.1662046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Sangbum Choi
- Department of Statistics, Korea University, Seoul, South Korea
| | - Xuelin Huang
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| |
Collapse
|
28
|
Zhang Y, Han X, Shao Y. The ROC of Cox proportional hazards cure models with application in cancer studies. LIFETIME DATA ANALYSIS 2021; 27:195-215. [PMID: 33507457 DOI: 10.1007/s10985-021-09516-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 01/13/2021] [Indexed: 06/12/2023]
Abstract
With recent advancement in cancer screening and treatment, many patients with cancers are identified at early stage and clinically cured. Importantly, uncured patients should be treated timely before the cancer progresses to advanced stages for which therapeutic options are rather limited. It is also crucial to identify uncured subjects among patients with early-stage cancers for clinical trials to develop effective adjuvant therapies. Thus, it is of interest to develop statistical predictive models with as high accuracy as possible in predicting the latent cure status. The receiver operating characteristic curve (ROC) and the area under the ROC curve (AUC) are among the most widely used statistical metrics for assessing predictive accuracy or discriminatory power for a dichotomous outcome (cured/uncured). Yet the conventional AUC cannot be directly used due to incompletely observed cure status. In this article, we proposed new estimates of the ROC curve and its AUC for predicting latent cure status in Cox proportional hazards (PH) cure models and transformation cure models. We developed explicit formulas to estimate sensitivity, specificity, the ROC and its AUC without requiring to know the patient cure status. We also developed EM type estimates to approximate sensitivity, specificity, ROC and AUC conditional on observed data. Numerical studies were used to assess their finite-sample performance of the proposed methods. Both methods are consistent and have similar efficiency as shown in our numerical studies. A melanoma dataset was used to demonstrate the utility of the proposed estimates of the ROC curve for the latent cure status. We also have developed an [Formula: see text] package called [Formula: see text] to efficiently compute the proposed estimates.
Collapse
Affiliation(s)
- Yilong Zhang
- Department of Biostatistics and Research Decision Sciences, Merck & Co., Inc, Kenilworth, NJ, USA
| | - Xiaoxia Han
- Department of Public Health Sciences, Henry Ford Health System, Detroit, MI, USA
| | - Yongzhao Shao
- Departments of Population Health & Environmental Medicine, NYU Grossman School of Medicine, 180 Madison Ave, 4th Floor, Suite 455, New York, NY, 10016, USA.
| |
Collapse
|
29
|
Chang YM, Shen PS, Tang YH. Confidence interval for the difference between two median survival times with semiparametric transformation models. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2018.1563156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Yu-Mei Chang
- Department of Statistics, Tunghai University, Taichung, Taiwan
| | - Pao-Sheng Shen
- Department of Statistics, Tunghai University, Taichung, Taiwan
| | - Yu-Hsin Tang
- Department of Statistics, Tunghai University, Taichung, Taiwan
| |
Collapse
|
30
|
De Neve J, Dehaene H. Semiparametric linear transformation models for indirectly observed outcomes. Stat Med 2021; 40:2286-2303. [PMID: 33565108 DOI: 10.1002/sim.8903] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 01/20/2021] [Accepted: 01/21/2021] [Indexed: 11/06/2022]
Abstract
We propose a regression framework to analyze outcomes that are indirectly observed via one or multiple proxies. Semiparametric transformation models, including Cox proportional hazards regression, turn out to be well suited to model the association between the covariates and the unobserved outcome. By coupling this regression model to a semiparametric measurement model, we can estimate these associations without requiring calibration data and without imposing strong functional assumptions on the relationship between the unobserved outcome and its proxy. When multiple proxies are available, we propose a data-driven aggregation resulting in an improved proxy. We empirically validate the proposed methodology in a simulation study, revealing good finite sample properties, especially when multiple proxies are aggregated. The methods are demonstrated on two case studies.
Collapse
Affiliation(s)
- Jan De Neve
- Department of Data Analysis, Ghent University, Ghent, Belgium
| | | |
Collapse
|
31
|
Chen CM, Shen PS, Liu Y. On semiparametric transformation model with LTRC data: pseudo likelihood approach. Stat Pap (Berl) 2021. [DOI: 10.1007/s00362-018-01080-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
32
|
Demarqui FN, Mayrink VD. Yang and Prentice model with piecewise exponential baseline distribution for modeling lifetime data with crossing survival curves. BRAZ J PROBAB STAT 2021. [DOI: 10.1214/20-bjps471] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
33
|
Zhong W, Lu X, Wu J. Bi-level variable selection in semiparametric transformation models with right-censored data. Comput Stat 2021. [DOI: 10.1007/s00180-021-01075-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
34
|
Liu Y, Lu M, McMahan CS. A penalized likelihood approach for efficiently estimating a partially linear additive transformation model with current status data. Electron J Stat 2021. [DOI: 10.1214/21-ejs1820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Yan Liu
- School of Community Health Sciences, University of Nevada, Reno, Reno, NV, USA
| | - Minggen Lu
- School of Community Health Sciences, University of Nevada, Reno, Reno, NV, USA
| | | |
Collapse
|
35
|
Piancastelli LSC, Barreto-Souza W, Mayrink VD. Generalized inverse-Gaussian frailty models with application to TARGET neuroblastoma data. ANN I STAT MATH 2020. [DOI: 10.1007/s10463-020-00774-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
36
|
Lee CY, Wong KY, Lam KF, Xu J. Analysis of clustered interval‐censored data using a class of semiparametric partly linear frailty transformation models. Biometrics 2020; 78:165-178. [DOI: 10.1111/biom.13399] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 10/19/2020] [Accepted: 10/22/2020] [Indexed: 11/30/2022]
Affiliation(s)
- Chun Yin Lee
- Department of Applied Mathematics The Hong Kong Polytechnic University Hung Hom, Kowloon Hong Kong People's Republic of China
| | - Kin Yau Wong
- Department of Applied Mathematics The Hong Kong Polytechnic University Hung Hom, Kowloon Hong Kong People's Republic of China
| | - K. F. Lam
- Department of Statistics and Actuarial Science The University of Hong Kong Hong Kong People's Republic of China
| | - Jinfeng Xu
- Department of Statistics and Actuarial Science The University of Hong Kong Hong Kong People's Republic of China
| |
Collapse
|
37
|
On semiparametric modelling, estimation and inference for survival data subject to dependent censoring. Biometrika 2020. [DOI: 10.1093/biomet/asaa095] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Summary
When modelling survival data, it is common to assume that the survival time $T$ is conditionally independent of the censoring time $C$ given a set of covariates. However, there are numerous situations in which this assumption is not realistic. The goal of this paper is therefore to develop a semiparametric normal transformation model which assumes that, after a proper nonparametric monotone transformation, the vector $(T, C)$ follows a linear model, and the vector of errors in this bivariate linear model follows a standard bivariate normal distribution with a possibly nondiagonal covariance matrix. We show that this semiparametric model is identifiable, and propose estimators of the nonparametric transformation, the regression coefficients and the correlation between the error terms. It is shown that the estimators of the model parameters and the transformation are consistent and asymptotically normal. We also assess the finite-sample performance of the proposed method by comparing it with an estimation method under a fully parametric model. Finally, our method is illustrated using data from the AIDS Clinical Trial Group 175 study.
Collapse
|
38
|
Lung function over the life course of paediatric and adult patients with cystic fibrosis from a large multi-centre registry. Sci Rep 2020; 10:17421. [PMID: 33060788 PMCID: PMC7567842 DOI: 10.1038/s41598-020-74502-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 09/29/2020] [Indexed: 12/24/2022] Open
Abstract
A key measure of lung function in people with Cystic Fibrosis (CF) is Forced Expiratory Volume in the first second FEV1 percent predicted (FEV1pp). This study aimed to address challenges in identifying predictors of FEV1pp, specifically dealing with non-linearity and the censoring effect of death. Data was obtained from a large multi-centre Australian Cystic Fibrosis Data Registry (ACFDR). A linear mixed model was used to study FEV1pp as the endpoint. There were 3655 patients (52.4% male) included in our study. Restricted cubic splines were used to fit the non-linear relationship between age of visit and FEV1pp. The following predictors were found to be significant in the multivariate model: age of patient at visit, BMI z-score, age interaction with lung transplantation, insulin dependent diabetes, cirrhosis/portal hypertension, pancreatic insufficiency, Pseudomonas aeruginosa infection and baseline variability in FEV1pp. Those with P. aeruginosa infection had a lower mean difference in FEV1pp of 4.7 units, p < 0.001 compared to those who did not have the infection. Joint modelling with mortality outcome did not materially affect our findings. These models will prove useful for to study the impact of CFTR modulator therapies on rate of change of lung function among patients with CF.
Collapse
|
39
|
Huang CH, Chen YH, Wang JL, Wang M. Semiparametric copula-based analysis for treatment effects in the presence of treatment switching. Stat Med 2020; 39:2936-2948. [PMID: 32578241 DOI: 10.1002/sim.8585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Revised: 03/23/2020] [Accepted: 04/30/2020] [Indexed: 11/11/2022]
Abstract
In controlled trials, "treatment switching" occurs when patients in one treatment group switch to alternative treatments during the trial, and poses challenges to treatment effect evaluation owing to crossover of the treatments groups. In this work, we assume that treatment switching can occur after some disease progression event and view the progression and death events as two semicompeting risks. The proposed model consists of a copula model for the joint distribution of time-to-progression (TTP) and overall survival (OS) up to the earlier of the two events, as well as a conditional hazard model for OS subsequent to progression. The copula model facilitates assessing the marginal distributions of TTP and OS separately from the association between the two events, and, in particular, the treatment effect on OS in the absence of treatment switching. The proposed conditional hazard model for death subsequent to progression allows us to assess the treatment switching (crossover) effect on OS given occurrence of progression and covariates. Semiparametric proportional hazards models are employed in the marginal models for TTP and OS. A nonparametric maximum likelihood procedure is developed for model inference, which is verified through asymptotic theory and simulation studies. The proposed analysis is applied to a lung cancer dataset to illustrate its real utility.
Collapse
Affiliation(s)
- Chia-Hui Huang
- Department of Statistics, National Chengchi University, Taipei, Taiwan
| | - Yi-Hau Chen
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| | - Jinn-Li Wang
- Division of Hematology Oncology, Department of Pediatrics, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,Department of Pediatrics, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Mey Wang
- Center for Drug Evaluation, Taipei, Taiwan
| |
Collapse
|
40
|
Xiaowen D, Libin J, Yuzhu T, Maozai T, Manlai T. Quantile regression for panel data models with fixed effects under random censoring. COMMUN STAT-THEOR M 2020. [DOI: 10.1080/03610926.2019.1601221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Dai Xiaowen
- School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai, China
| | - Jin Libin
- School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai, China
| | - Tian Yuzhu
- School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang, China
| | - Tian Maozai
- School of Statistics, Renmin University of China, Beijing, China
- School of Statistics, Lanzhou University of Finance and Economics, Lanzhou, China
| | - Tang Manlai
- Department of Mathematics and Statistics, Hang Seng University of Hong Kong, Hong Kong, China
| |
Collapse
|
41
|
Zheng Y, Zhao X, Zhang X. A novel approach to estimate the Cox model with temporal covariates and application to medical cost data. COMMUN STAT-THEOR M 2020. [DOI: 10.1080/03610926.2019.1602651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Yanqiao Zheng
- School of Finance, Zhejiang University of Finance and Economics, Hangzhou, China
| | - Xiaobing Zhao
- School of Data Science, Zhejiang University of Finance and Economics Hangzhou, China
| | - Xiaoqi Zhang
- School of Finance, Zhejiang University of Finance and Economics, Hangzhou, China
| |
Collapse
|
42
|
Likelihood-based estimation of a semiparametric time-dependent jump diffusion model of the short-term interest rate. Comput Stat 2020. [DOI: 10.1007/s00180-019-00875-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
43
|
Shin S, Liu Y, Cole SR, Fine JP. Ensemble estimation and variable selection with semiparametric regression models. Biometrika 2020; 107:433-448. [PMID: 32454529 DOI: 10.1093/biomet/asaa012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Indexed: 11/14/2022] Open
Abstract
We consider scenarios in which the likelihood function for a semiparametric regression model factors into separate components, with an efficient estimator of the regression parameter available for each component. An optimal weighted combination of the component estimators, named an ensemble estimator, may be employed as an overall estimate of the regression parameter, and may be fully efficient under uncorrelatedness conditions. This approach is useful when the full likelihood function may be difficult to maximize, but the components are easy to maximize. It covers settings where the nuisance parameter may be estimated at different rates in the component likelihoods. As a motivating example we consider proportional hazards regression with prospective doubly censored data, in which the likelihood factors into a current status data likelihood and a left-truncated right-censored data likelihood. Variable selection is important in such regression modelling, but the applicability of existing techniques is unclear in the ensemble approach. We propose ensemble variable selection using the least squares approximation technique on the unpenalized ensemble estimator, followed by ensemble re-estimation under the selected model. The resulting estimator has the oracle property such that the set of nonzero parameters is successfully recovered and the semiparametric efficiency bound is achieved for this parameter set. Simulations show that the proposed method performs well relative to alternative approaches. Analysis of an AIDS cohort study illustrates the practical utility of the method.
Collapse
Affiliation(s)
- Sunyoung Shin
- Department of Mathematical Sciences, University of Texas at Dallas, 800 W. Campbell Rd., Richardson, Texas 75080, U.S.A
| | - Yufeng Liu
- Department of Statistics and Operations Research, CB# 3260, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A
| | - Stephen R Cole
- Department of Epidemiology, CB# 7435, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A
| | - Jason P Fine
- Department of Biostatistics, CB# 7420, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A
| |
Collapse
|
44
|
Tian Y, Hothorn T, Li C, Harrell FE, Shepherd BE. An empirical comparison of two novel transformation models. Stat Med 2020; 39:562-576. [PMID: 31808976 PMCID: PMC7537829 DOI: 10.1002/sim.8425] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Revised: 10/16/2019] [Accepted: 10/20/2019] [Indexed: 12/28/2022]
Abstract
Continuous response variables are often transformed to meet modeling assumptions, but the choice of the transformation can be challenging. Two transformation models have recently been proposed: semiparametric cumulative probability models (CPMs) and parametric most likely transformation models (MLTs). Both approaches model the cumulative distribution function and require specifying a link function, which implicitly assumes that the responses follow a known distribution after some monotonic transformation. However, the two approaches estimate the transformation differently. With CPMs, an ordinal regression model is fit, which essentially treats each continuous response as a unique category and therefore nonparametrically estimates the transformation; CPMs are semiparametric linear transformation models. In contrast, with MLTs, the transformation is parameterized using flexible basis functions. Conditional expectations and quantiles are readily derived from both methods on the response variable's original scale. We compare the two methods with extensive simulations. We find that both methods generally have good performance with moderate and large sample sizes. MLTs slightly outperformed CPMs in small sample sizes under correct models. CPMs tended to be somewhat more robust to model misspecification and outcome rounding. Except in the simplest situations, both methods outperform basic transformation approaches commonly used in practice. We apply both methods to an HIV biomarker study.
Collapse
Affiliation(s)
- Yuqi Tian
- Department of Biostatistics, Vanderbilt University, Nashville, TN, USA
| | - Torsten Hothorn
- Institut für Epidemiologie, Biostatistik und Prävention, Universität Zürich, Zürich, Switzerland
| | - Chun Li
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA
| | - Frank E. Harrell
- Department of Biostatistics, Vanderbilt University, Nashville, TN, USA
| | - Bryan E. Shepherd
- Department of Biostatistics, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
45
|
Hirose Y, Liu I. Statistical Generalized Derivative Applied to the Profile Likelihood Estimation in a Mixture of Semiparametric Models. ENTROPY (BASEL, SWITZERLAND) 2020; 22:e22030278. [PMID: 33286050 PMCID: PMC7516731 DOI: 10.3390/e22030278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 02/19/2020] [Accepted: 02/25/2020] [Indexed: 06/12/2023]
Abstract
There is a difficulty in finding an estimate of the standard error (SE) of the profile likelihood estimator in the joint model of longitudinal and survival data. The difficulty is on the differentiation of an implicit function that appear in the profile likelihood estimation. We solve the difficulty by introducing the "statistical generalized derivative". The derivative is used to show the asymptotic normality of the estimator with the SE expressed in terms of the profile likelihood score function.
Collapse
|
46
|
Su L, Lu W, Song R, Huang D. Testing and Estimation of Social Network Dependence With Time to Event Data. J Am Stat Assoc 2020; 115:1-28. [PMID: 34012183 DOI: 10.1080/01621459.2019.1617153] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Nowadays, events are spread rapidly along social networks. We are interested in whether people's responses to an event are affected by their friends' characteristics. For example, how soon will a person start playing a game given that his/her friends like it? Studying social network dependence is an emerging research area. In this work, we propose a novel latent spatial autocorrelation Cox model to study social network dependence with time-to-event data. The proposed model introduces a latent indicator to characterize whether a person's survival time might be affected by his or her friends' features. We first propose a score-type test for detecting the existence of social network dependence. If it exists, we further develop an EM-type algorithm to estimate the model parameters. The performance of the proposed test and estimators are illustrated by simulation studies and an application to a time-to-event dataset about playing a popular mobile game from one of the largest online social network platforms. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
Collapse
Affiliation(s)
- Lin Su
- Department of Statistics, North Carolina State University, Raleigh, NC
| | - Wenbin Lu
- Department of Statistics, North Carolina State University, Raleigh, NC
| | - Rui Song
- Department of Statistics, North Carolina State University, Raleigh, NC
| | - Danyang Huang
- School of Statistics, Remin University, Beijing, China
| |
Collapse
|
47
|
Burke K, Eriksson F, Pipper CB. Semiparametric multiparameter regression survival modeling. Scand Stat Theory Appl 2019. [DOI: 10.1111/sjos.12416] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Kevin Burke
- Department of Mathematics and Statistics University of Limerick
| | | | | |
Collapse
|
48
|
Garcia TP, Parast L. Dynamic landmark prediction for mixture data. Biostatistics 2019; 22:558-574. [PMID: 31758793 PMCID: PMC8286554 DOI: 10.1093/biostatistics/kxz052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Revised: 10/27/2019] [Accepted: 10/30/2019] [Indexed: 11/13/2022] Open
Abstract
In kin-cohort studies, clinicians want to provide their patients with the most current cumulative risk of death arising from a rare deleterious mutation. Estimating the cumulative risk is difficult when the genetic mutation status is unknown and only estimated probabilities of a patient having the mutation are available. We estimate the cumulative risk for this scenario using a novel nonparametric estimator that incorporates covariate information and dynamic landmark prediction. Our estimator has improved prediction accuracy over existing estimators that ignore covariate information. It is built within a dynamic landmark prediction framework whereby we can obtain personalized dynamic predictions over time. Compared to current standards, a simple transformation of our estimator provides more efficient estimates of marginal distribution functions in settings where patient-specific predictions are not the main goal. We show our estimator is unbiased and has more predictive accuracy compared to methods that ignore covariate information and landmarking. Applying our method to a Huntington disease study of mortality, we develop dynamic survival prediction curves incorporating gender and familial genetic information.
Collapse
Affiliation(s)
- Tanya P Garcia
- Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843-3143, USA and RAND Corporation, 1776 Main Street, Santa Monica, CA 90401, USA
| | - Layla Parast
- Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843-3143, USA and RAND Corporation, 1776 Main Street, Santa Monica, CA 90401, USA
| |
Collapse
|
49
|
Li S, Wu Q, Sun J. Penalized estimation of semiparametric transformation models with interval-censored data and application to Alzheimer's disease. Stat Methods Med Res 2019; 29:2151-2166. [PMID: 31718478 DOI: 10.1177/0962280219884720] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Variable selection or feature extraction is fundamental to identify important risk factors from a large number of covariates and has applications in many fields. In particular, its applications in failure time data analysis have been recognized and many methods have been proposed for right-censored data. However, developing relevant methods for variable selection becomes more challenging when one confronts interval censoring that often occurs in practice. In this article, motivated by an Alzheimer's disease study, we develop a variable selection method for interval-censored data with a general class of semiparametric transformation models. Specifically, a novel penalized expectation-maximization algorithm is developed to maximize the complex penalized likelihood function, which is shown to perform well in the finite-sample situation through a simulation study. The proposed methodology is then applied to the interval-censored data arising from the Alzheimer's disease study mentioned above.
Collapse
Affiliation(s)
- Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Qiwei Wu
- Department of Statistics, University of Missouri, Columbia, MO, USA
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, MO, USA
| |
Collapse
|
50
|
Affiliation(s)
- Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, People's Republic of China
| | - Tao Hu
- School of Mathematical Sciences, Capital Normal University, Beijing, People's Republic of China
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, MO, USA
| |
Collapse
|