1
|
Lou Y, Ma Y, Du M. A new and unified method for regression analysis of interval-censored failure time data under semiparametric transformation models with missing covariates. Stat Med 2024; 43:2062-2082. [PMID: 38757695 DOI: 10.1002/sim.10035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 01/25/2024] [Accepted: 01/30/2024] [Indexed: 05/18/2024]
Abstract
This paper discusses regression analysis of interval-censored failure time data arising from semiparametric transformation models in the presence of missing covariates. Although some methods have been developed for the problem, they either apply only to limited situations or may have some computational issues. Corresponding to these, we propose a new and unified two-step inference procedure that can be easily implemented using the existing or standard software. The proposed method makes use of a set of working models to extract partial information from incomplete observations and yields a consistent estimator of regression parameters assuming missing at random. An extensive simulation study is conducted and indicates that it performs well in practical situations. Finally, we apply the proposed approach to an Alzheimer's Disease study that motivated this study.
Collapse
Affiliation(s)
- Yichen Lou
- School of Mathematics, Jilin University, Changchun, China
| | - Yuqing Ma
- School of Mathematics, Jilin University, Changchun, China
| | - Mingyue Du
- School of Mathematics, Jilin University, Changchun, China
| |
Collapse
|
2
|
Wang C, Du M. Martingale-residual-based greedy model averaging for high-dimensional current status data. Stat Med 2024; 43:1726-1742. [PMID: 38381059 DOI: 10.1002/sim.10037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 12/08/2023] [Accepted: 02/01/2024] [Indexed: 02/22/2024]
Abstract
Current status data are a type of failure time data that arise when the failure time of study subject cannot be determined precisely but is known only to occur before or after a random monitoring time. Variable selection methods for the failure time data have been discussed extensively in the literature. However, the statistical inference of the model selected based on the variable selection method ignores the uncertainty caused by model selection. To enhance the prediction accuracy for risk quantities such as survival probability, we propose two optimal model averaging methods under semiparametric additive hazards models. Specifically, based on martingale residuals processes, a delete-one cross-validation (CV) process is defined, and two new CV functional criteria are derived for choosing model weights. Furthermore, we present a greedy algorithm for the implementation of the techniques, and the asymptotic optimality of the proposed model averaging approaches is established, along with the convergence of the greedy averaging algorithms. A series of simulation experiments demonstrate the effectiveness and superiority of the proposed methods. Finally, a real-data example is provided as an illustration.
Collapse
Affiliation(s)
- Chang Wang
- School of Mathematics, Jilin University, Changchun, China
| | - Mingyue Du
- School of Mathematics, Jilin University, Changchun, China
| |
Collapse
|
3
|
Wu Y, Zhao H, Sun J. Group variable selection for the Cox model with interval-censored failure time data. Biometrics 2023; 79:3082-3095. [PMID: 37211860 DOI: 10.1111/biom.13879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 04/28/2023] [Indexed: 05/23/2023]
Abstract
Group variable selection is often required in many areas, and for this many methods have been developed under various situations. Unlike the individual variable selection, the group variable selection can select the variables in groups, and it is more efficient to identify both important and unimportant variables or factors by taking into account the existing group structure. In this paper, we consider the situation where one only observes interval-censored failure time data arising from the Cox model, for which there does not seem to exist an established method. More specifically, a penalized sieve maximum likelihood variable selection and estimation procedure is proposed and the oracle property of the proposed method is established. Also, an extensive simulation study is performed and suggests that the proposed approach works well in practical situations. An application of the method to a set of real data is provided.
Collapse
Affiliation(s)
- Yuxiang Wu
- Department of Statistics, University of Missouri, Columbia, Missouri, USA
| | - Hui Zhao
- School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, China
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
4
|
Ge L, Liang B, Hu T, Sun J, Zhao S, Li Y. Variable selection for mixed panel count data under the proportional mean model. Stat Methods Med Res 2023; 32:1728-1748. [PMID: 37401336 DOI: 10.1177/09622802231184637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/05/2023]
Abstract
Mixed panel count data have attracted increasing attention in medical research based on event history studies. When such data arise, one either observes the number of event occurrences or only knows whether the event has happened or not over an observation period. In this article, we discuss variable selection in event history studies given such complex data, for which there does not seem to exist an established procedure. For the problem, we propose a penalized likelihood variable selection procedure and for the implementation, an expectation-maximization algorithm is developed with the use of the coordinate descent algorithm in the M-step. Furthermore, the oracle property of the proposed method is established, and a simulation study is performed and indicates that the proposed method works well in practical scenarios. Finally, the method is applied to identify the risk factors associated with medical non-adherence arising from the Sequenced Treatment Alternatives to Relieve Depression Study.
Collapse
Affiliation(s)
- Lei Ge
- Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Baosheng Liang
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Tao Hu
- School of Mathematical Sciences, Capital Normal University, Beijing, China
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, MO, USA
| | - Shishun Zhao
- Applied Statistical Research Center, School of Mathematics, Jilin University, Changchun, China
| | - Yang Li
- Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN, USA
| |
Collapse
|
5
|
Lu F, Huang X, Lu X, Tian G, Yang J. Model detection for semiparametric accelerated failure additive model with right-censored data. Stat Methods Med Res 2023; 32:1527-1542. [PMID: 37338958 DOI: 10.1177/09622802231181224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023]
Abstract
Censored data frequently appeared in applications across a variety of different areas like epidemiology or medical research. Traditionally statistical inference on this data mechanism was based on some pre-assigned models that will suffer from the risk of model-misspecification. This article proposes a two-folded shrinkage procedure for simultaneous structure identification and variable selection of the semiparametric accelerated failure additive model with right-censored data, in which the nonparametric functions are addressed by spline approximation. Under some regularity conditions, the consistency of model structure identification is theoretically established in the sense that the proposed method can automatically separate the linear and zero components from the nonlinear ones with probability approaching to one. Detailed issues in computation and turning parameter selection are also discussed. Finally, we illustrate the proposed method by some simulation studies and two real data applications to the primary biliary cirrhosis data and skin cutaneous melanoma data.
Collapse
Affiliation(s)
- Fang Lu
- MOE-LCSM, School of Mathematics and Statistics, Hunan Normal University, Changsha, China
| | - Xiaoyan Huang
- MOE-LCSM, School of Mathematics and Statistics, Hunan Normal University, Changsha, China
| | - Xuewen Lu
- Department of Mathematics and Statistics, University of Calgary, Canada
| | - Guoliang Tian
- Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen, China Fang Lu and Xiaoyan Huang are joint first authors
| | - Jing Yang
- MOE-LCSM, School of Mathematics and Statistics, Hunan Normal University, Changsha, China
| |
Collapse
|
6
|
Sun T, Li Y, Xiao Z, Ding Y, Wang X. Semiparametric copula method for semi-competing risks data subject to interval censoring and left truncation: Application to disability in elderly. Stat Methods Med Res 2023; 32:656-670. [PMID: 36735020 PMCID: PMC11070129 DOI: 10.1177/09622802221133552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
We aim to evaluate the marginal effects of covariates on time-to-disability in the elderly under the semi-competing risks framework, as death dependently censors disability, not vice versa. It becomes particularly challenging when time-to-disability is subject to interval censoring due to intermittent assessments. A left truncation issue arises when the age time scale is applied. We develop a flexible two-parameter copula-based semiparametric transformation model for semi-competing risks data subject to interval censoring and left truncation. The two-parameter copula quantifies both upper and lower tail dependence between two margins. The semiparametric transformation models incorporate proportional hazards and proportional odds models in both margins. We propose a two-step sieve maximum likelihood estimation procedure and study the sieve estimators' asymptotic properties. Simulations show that the proposed method corrects biases in the marginal method. We demonstrate the proposed method in a large-scale Chinese Longitudinal Healthy Longevity Study and provide new insights into preventing disability in the elderly. The proposed method could be applied to the general semi-competing risks data with intermittently assessed disease status.
Collapse
Affiliation(s)
- Tao Sun
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
| | - Yunlong Li
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
| | - Zhengyan Xiao
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
| | - Ying Ding
- Department of Biostatistics, University of Pittsburgh, PA, USA
| | - Xiaojun Wang
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
| |
Collapse
|
7
|
Friedrich S, Groll A, Ickstadt K, Kneib T, Pauly M, Rahnenführer J, Friede T. Regularization approaches in clinical biostatistics: A review of methods and their applications. Stat Methods Med Res 2023; 32:425-440. [PMID: 36384320 PMCID: PMC9896544 DOI: 10.1177/09622802221133557] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
A range of regularization approaches have been proposed in the data sciences to overcome overfitting, to exploit sparsity or to improve prediction. Using a broad definition of regularization, namely controlling model complexity by adding information in order to solve ill-posed problems or to prevent overfitting, we review a range of approaches within this framework including penalization, early stopping, ensembling and model averaging. Aspects of their practical implementation are discussed including available R-packages and examples are provided. To assess the extent to which these approaches are used in medicine, we conducted a review of three general medical journals. It revealed that regularization approaches are rarely applied in practical clinical applications, with the exception of random effects models. Hence, we suggest a more frequent use of regularization approaches in medical research. In situations where also other approaches work well, the only downside of the regularization approaches is increased complexity in the conduct of the analyses which can pose challenges in terms of computational resources and expertise on the side of the data analyst. In our view, both can and should be overcome by investments in appropriate computing facilities and educational resources.
Collapse
Affiliation(s)
- Sarah Friedrich
- Institute of Mathematics, University of
Augsburg, Augsburg, Germany
- Centre for Advanced Analytics and Predictive Sciences, University of
Augsburg, Augsburg, Germany
| | - Andreas Groll
- Department of Statistics, TU Dortmund
University, Dortmund, Germany
| | - Katja Ickstadt
- Department of Statistics, TU Dortmund
University, Dortmund, Germany
| | - Thomas Kneib
- Chair of Statistics and Campus Institute Data Science,
Georg-August-University Göttingen,
Göttingen, Germany
| | - Markus Pauly
- Department of Statistics, TU Dortmund
University, Dortmund, Germany
| | | | - Tim Friede
- Department of Medical Statistics, University Medical Center
Göttingen, Göttingen, Germany
- DZHK (German Center for Cardiovascular Research), partner site
Göttingen, Göttingen, Germany
| |
Collapse
|
8
|
Tian T, Sun J. Variable selection for nonparametric additive Cox model with interval-censored data. Biom J 2023; 65:e2100310. [PMID: 35923136 DOI: 10.1002/bimj.202100310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 05/05/2022] [Accepted: 05/28/2022] [Indexed: 01/17/2023]
Abstract
The standard Cox model is perhaps the most commonly used model for regression analysis of failure time data but it has some limitations such as the assumption on linear covariate effects. To relax this, the nonparametric additive Cox model, which allows for nonlinear covariate effects, is often employed, and this paper will discuss variable selection and structure estimation for this general model. For the problem, we propose a penalized sieve maximum likelihood approach with the use of Bernstein polynomials approximation and group penalization. To implement the proposed method, an efficient group coordinate descent algorithm is developed and can be easily carried out for both low- and high-dimensional scenarios. Furthermore, a simulation study is performed to assess the performance of the presented approach and suggests that it works well in practice. The proposed method is applied to an Alzheimer's disease study for identifying important and relevant genetic factors.
Collapse
Affiliation(s)
- Tian Tian
- Department of Statistics, University of Missouri, Columbia, USA, MO
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, USA, MO
| |
Collapse
|
9
|
Luo L, Yu J, Zhao H. The sparse estimation of the semiparametric linear transformation model with dependent current status data. J Appl Stat 2022; 51:759-779. [PMID: 38414802 PMCID: PMC10896163 DOI: 10.1080/02664763.2022.2161488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 12/18/2022] [Indexed: 12/31/2022]
Abstract
In this paper, we study the sparse estimation under the semiparametric linear transformation models for the current status data, also called type I interval-censored data. For the problem, the failure time of interest may be dependent on the censoring time and the association parameter between them is left unspecified. To address this, we employ the copula model to describe the dependence between them and a two-stage estimation procedure to estimate both the association parameter and the regression parameter. In addition, we propose a penalized maximum likelihood estimation procedure based on the broken adaptive ridge regression, and Bernstein polynomials are used to approximate the nonparametric functions involved. The oracle property of the proposed method is established and the numerical studies suggest that the method works well for practical situations. Finally, the method is applied to an Alzheimer's disease study that motivated this investigation.
Collapse
Affiliation(s)
- Lin Luo
- College of Science, Zhongyuan University of Technology, Zhengzhou, People's Republic of China
| | - Jinzhao Yu
- School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, People's Republic of China
| | - Hui Zhao
- School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, People's Republic of China
| |
Collapse
|
10
|
Sun L, Li S, Wang L, Song X, Sui X. Simultaneous variable selection in regression analysis of multivariate interval-censored data. Biometrics 2022; 78:1402-1413. [PMID: 34407218 DOI: 10.1111/biom.13548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 05/13/2021] [Accepted: 08/03/2021] [Indexed: 12/30/2022]
Abstract
Multivariate interval-censored data arise when each subject under study can potentially experience multiple events and the onset time of each event is not observed exactly but is known to lie in a certain time interval formed by adjacent examination times with changed statuses of the event. This type of incomplete and complex data structure poses a substantial challenge in practical data analysis. In addition, many potential risk factors exist in numerous studies. Thus, conducting variable selection for event-specific covariates simultaneously becomes useful in identifying important variables and assessing their effects on the events of interest. In this paper, we develop a variable selection technique for multivariate interval-censored data under a general class of semiparametric transformation frailty models. The minimum information criterion (MIC) method is embedded in the optimization step of the proposed expectation-maximization (EM) algorithm to obtain the parameter estimator. The proposed EM algorithm greatly reduces the computational burden in maximizing the observed likelihood function, and the MIC naturally avoids selecting the optimal tuning parameter as needed in many other popular penalties, making the proposed algorithm promising and reliable. The proposed method is evaluated through extensive simulation studies and illustrated by an analysis of patient data from the Aerobics Center Longitudinal Study.
Collapse
Affiliation(s)
- Liuquan Sun
- School of Economics and Statistics, Guangzhou University, Guangzhou, China.,Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Lianming Wang
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| | - Xinyuan Song
- Department of Statistics, Chinese University of Hong Kong, Hong Kong
| | - Xuemei Sui
- Department of Exercise Science, Arnold School of Public Health, University of South Carolina, Columbia, South Carolina, USA
| |
Collapse
|
11
|
Du M, Zhao X, Sun J. Variable selection for case-cohort studies with informatively interval-censored outcomes. Comput Stat Data Anal 2022. [DOI: 10.1016/j.csda.2022.107484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
12
|
Zhou R, Li H, Sun J, Tang N. A new approach to estimation of the proportional hazards model based on interval-censored data with missing covariates. LIFETIME DATA ANALYSIS 2022; 28:335-355. [PMID: 35352270 DOI: 10.1007/s10985-022-09550-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 02/03/2022] [Indexed: 06/14/2023]
Abstract
This paper discusses the fitting of the proportional hazards model to interval-censored failure time data with missing covariates. Many authors have discussed the problem when complete covariate information is available or the missing is completely at random. In contrast to this, we will focus on the situation where the missing is at random. For the problem, a sieve maximum likelihood estimation approach is proposed with the use of I-spline functions to approximate the unknown cumulative baseline hazard function in the model. For the implementation of the proposed method, we develop an EM algorithm based on a two-stage data augmentation. Furthermore, we show that the proposed estimators of regression parameters are consistent and asymptotically normal. The proposed approach is then applied to a set of the data concerning Alzheimer Disease that motivated this study.
Collapse
Affiliation(s)
- Ruiwen Zhou
- Department of Statistics, University of Missouri, Columbia, MO, 65211, USA
| | - Huiqiong Li
- Department of Statistics, Yunnan University, Kunming, 650091, China.
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, MO, 65211, USA
| | - Niansheng Tang
- Department of Statistics, Yunnan University, Kunming, 650091, China
| |
Collapse
|
13
|
Liu R, Du M, Sun J. Variable selection for bivariate interval-censored failure time data under linear transformation models. Int J Biostat 2022:ijb-2021-0031. [PMID: 35654407 DOI: 10.1515/ijb-2021-0031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Accepted: 04/20/2022] [Indexed: 11/15/2022]
Abstract
Variable selection is needed and performed in almost every field and a large literature on it has been established, especially under the context of linear models or for complete data. Many authors have also investigated the variable selection problem for incomplete data such as right-censored failure time data. In this paper, we discuss variable selection when one faces bivariate interval-censored failure time data arising from a linear transformation model, for which it does not seem to exist an established procedure. For the problem, a penalized maximum likelihood approach is proposed and in particular, a novel Poisson-based EM algorithm is developed for the implementation. The oracle property of the proposed method is established, and the numerical studies suggest that the method works well for practical situations.
Collapse
Affiliation(s)
- Rong Liu
- Center for Applied Statistical Research, School of Mathematics, Jilin University, Changchun 130012, China
| | - Mingyue Du
- Center for Applied Statistical Research, School of Mathematics, Jilin University, Changchun 130012, China
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, MO, 65211, USA
| |
Collapse
|
14
|
Li C, Xiao L, Luo S. Joint model for survival and multivariate sparse functional data with application to a study of Alzheimer's Disease. Biometrics 2022; 78:435-447. [PMID: 33501651 PMCID: PMC8310894 DOI: 10.1111/biom.13427] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 11/06/2020] [Accepted: 01/08/2021] [Indexed: 11/29/2022]
Abstract
Studies of Alzheimer's disease (AD) often collect multiple longitudinal clinical outcomes, which are correlated and predictive of AD progression. It is of great scientific interest to investigate the association between the outcomes and time to AD onset. We model the multiple longitudinal outcomes as multivariate sparse functional data and propose a functional joint model linking multivariate functional data to event time data. In particular, we propose a multivariate functional mixed model to identify the shared progression pattern and outcome-specific progression patterns of the outcomes, which enables more interpretable modeling of associations between outcomes and AD onset. The proposed method is applied to the Alzheimer's Disease Neuroimaging Initiative study (ADNI) and the functional joint model sheds new light on inference of five longitudinal outcomes and their associations with AD onset. Simulation studies also confirm the validity of the proposed model. Data used in preparation of this article were obtained from the ADNI database.
Collapse
Affiliation(s)
- Cai Li
- Department of Biostatistics, Yale University, New Haven, Connecticut, USA
| | - Luo Xiao
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Sheng Luo
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA
| |
Collapse
|
15
|
Wang W, Fang L, Li S, Sun J. Variable selection for misclassified current status data under the proportional hazards model. COMMUN STAT-SIMUL C 2022. [DOI: 10.1080/03610918.2022.2050391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Affiliation(s)
- Wenshan Wang
- Center for Applied Statistical Research, School of Mathematics, Jilin University, Changchun, China
| | - Lijun Fang
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Shuwei Li
- School of Economics and Statistics, Guangzhou University, Guangzhou, China
| | - Jianguo Sun
- Department of Statistics, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
16
|
Variable Selection for Generalized Linear Models with Interval-Censored Failure Time Data. MATHEMATICS 2022. [DOI: 10.3390/math10050763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Variable selection is often needed in many fields and has been discussed by many authors in various situations. This is especially the case under linear models and when one observes complete data. Among others, one common situation where variable selection is required is to identify important risk factors from a large number of covariates. In this paper, we consider the problem when one observes interval-censored failure time data arising from generalized linear models, for which there does not seem to exist an established method. To address this, we propose a penalized least squares method with the use of an unbiased transformation and the oracle property of the method is established along with the asymptotic normality of the resulting estimators of regression parameters. Simulation studies were conducted and demonstrated that the proposed method performed well for practical situations. In addition, the method was applied to a motivating example about children’s mortality data of Nigeria.
Collapse
|
17
|
Affiliation(s)
- Mingyue Du
- Department of Applied Mathematics The Hong Kong Polytechnic University Hong Kong China
| | - Jianguo Sun
- Department of Statistics University of Missouri Columbia MO 65211 USA
| |
Collapse
|
18
|
Xu Y, Zhao S, Hu T, Sun J. Variable selection for generalized odds rate mixture cure models with interval-censored failure time data. Comput Stat Data Anal 2021. [DOI: 10.1016/j.csda.2020.107115] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
19
|
Zhong W, Lu X, Wu J. Bi-level variable selection in semiparametric transformation models with right-censored data. Comput Stat 2021. [DOI: 10.1007/s00180-021-01075-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|