1
|
Morrison S, Gatsonis C, Eloyan A, Steingrimsson JA. Survival analysis using deep learning with medical imaging. Int J Biostat 2024; 20:1-12. [PMID: 37312249 PMCID: PMC11074924 DOI: 10.1515/ijb-2022-0113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 02/24/2023] [Indexed: 06/15/2023]
Abstract
There is widespread interest in using deep learning to build prediction models for medical imaging data. These deep learning methods capture the local structure of the image and require no manual feature extraction. Despite the importance of modeling survival in the context of medical data analysis, research on deep learning methods for modeling the relationship of imaging and time-to-event data is still under-developed. We provide an overview of deep learning methods for time-to-event outcomes and compare several deep learning methods to Cox model based methods through the analysis of a histology dataset of gliomas.
Collapse
Affiliation(s)
- Samantha Morrison
- Department of Biostatistics, School of Public Health, Brown University, Providence, RI, USA
| | - Constantine Gatsonis
- Department of Biostatistics, School of Public Health, Brown University, Providence, RI, USA
| | - Ani Eloyan
- Department of Biostatistics, School of Public Health, Brown University, Providence, RI, USA
| | - Jon Arni Steingrimsson
- Department of Biostatistics, School of Public Health, Brown University, Providence, RI, USA
| |
Collapse
|
2
|
Rhodes G, Davidian M, Lu W. Estimation of optimal treatment regimes with electronic medical record data using the residual life value estimator. Biostatistics 2024:kxae002. [PMID: 38332633 DOI: 10.1093/biostatistics/kxae002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 11/10/2023] [Accepted: 01/03/2024] [Indexed: 02/10/2024] Open
Abstract
Clinicians and patients must make treatment decisions at a series of key decision points throughout disease progression. A dynamic treatment regime is a set of sequential decision rules that return treatment decisions based on accumulating patient information, like that commonly found in electronic medical record (EMR) data. When applied to a patient population, an optimal treatment regime leads to the most favorable outcome on average. Identifying optimal treatment regimes that maximize residual life is especially desirable for patients with life-threatening diseases such as sepsis, a complex medical condition that involves severe infections with organ dysfunction. We introduce the residual life value estimator (ReLiVE), an estimator for the expected value of cumulative restricted residual life under a fixed treatment regime. Building on ReLiVE, we present a method for estimating an optimal treatment regime that maximizes expected cumulative restricted residual life. Our proposed method, ReLiVE-Q, conducts estimation via the backward induction algorithm Q-learning. We illustrate the utility of ReLiVE-Q in simulation studies, and we apply ReLiVE-Q to estimate an optimal treatment regime for septic patients in the intensive care unit using EMR data from the Multiparameter Intelligent Monitoring Intensive Care database. Ultimately, we demonstrate that ReLiVE-Q leverages accumulating patient information to estimate personalized treatment regimes that optimize a clinically meaningful function of residual life.
Collapse
Affiliation(s)
- Grace Rhodes
- Eli Lilly and Company, Indianapolis, IN 46204, USA
| | - Marie Davidian
- Department of Statistics, North Carolina State University, SAS Hall, 2311 Stinson Dr, Raleigh, NC 27607, USA
| | - Wenbin Lu
- Department of Statistics, North Carolina State University, SAS Hall, 2311 Stinson Dr, Raleigh, NC 27607, USA
| |
Collapse
|
3
|
Shah KS, Fu H, Kosorok MR. Stabilized direct learning for efficient estimation of individualized treatment rules. Biometrics 2023; 79:2843-2856. [PMID: 36585916 DOI: 10.1111/biom.13818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 12/23/2022] [Indexed: 01/01/2023]
Abstract
In recent years, the field of precision medicine has seen many advancements. Significant focus has been placed on creating algorithms to estimate individualized treatment rules (ITRs), which map from patient covariates to the space of available treatments with the goal of maximizing patient outcome. Direct learning (D-Learning) is a recent one-step method which estimates the ITR by directly modeling the treatment-covariate interaction. However, when the variance of the outcome is heterogeneous with respect to treatment and covariates, D-Learning does not leverage this structure. Stabilized direct learning (SD-Learning), proposed in this paper, utilizes potential heteroscedasticity in the error term through a residual reweighting which models the residual variance via flexible machine learning algorithms such as XGBoost and random forests. We also develop an internal cross-validation scheme which determines the best residual model among competing models. SD-Learning improves the efficiency of D-Learning estimates in binary and multi-arm treatment scenarios. The method is simple to implement and an easy way to improve existing algorithms within the D-Learning family, including original D-Learning, Angle-based D-Learning (AD-Learning), and Robust D-learning (RD-Learning). We provide theoretical properties and justification of the optimality of SD-Learning. Head-to-head performance comparisons with D-Learning methods are provided through simulations, which demonstrate improvement in terms of average prediction error (APE), misclassification rate, and empirical value, along with a data analysis of an acquired immunodeficiency syndrome (AIDS) randomized clinical trial.
Collapse
Affiliation(s)
- Kushal S Shah
- Department of Biostatistics, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Haoda Fu
- Eli Lilly and Company, Lilly Corporate Center, Indianapolis, USA
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, North Carolina, USA
| |
Collapse
|
4
|
Siriwardhana C, Kulasekera K, Datta S. Selection of the optimal personalized treatment from multiple treatments with right-censored multivariate outcome measures. J Appl Stat 2023; 51:891-912. [PMID: 38524800 PMCID: PMC10956931 DOI: 10.1080/02664763.2022.2164759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 12/29/2022] [Indexed: 01/11/2023]
Abstract
We propose a novel personalized concept for the optimal treatment selection for a situation where the response is a multivariate vector that could contain right-censored variables such as survival time. The proposed method can be applied with any number of treatments and outcome variables, under a broad set of models. Following a working semiparametric Single Index Model that relates covariates and responses, we first define a patient-specific composite score, constructed from individual covariates. We then estimate conditional means of each response, given the patient score, correspond to each treatment, using a nonparametric smooth estimator. Next, a rank aggregation technique is applied to estimate an ordering of treatments based on ranked lists of treatment performance measures given by conditional means. We handle the right-censored data by incorporating the inverse probability of censoring weighting to the corresponding estimators. An empirical study illustrates the performance of the proposed method in finite sample problems. To show the applicability of the proposed procedure for real data, we also present a data analysis using HIV clinical trial data, that contained a right-censored survival event as one of the endpoints.
Collapse
Affiliation(s)
- Chathura Siriwardhana
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, Honolulu, HI, USA
| | - K.B. Kulasekera
- Department of Bioinformatics & Biostatistics, University of Louisville, Louisville, KY, USA
| | - Somnath Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, USA
| |
Collapse
|
5
|
Model selection for survival individualized treatment rules using the jackknife estimator. BMC Med Res Methodol 2022; 22:328. [PMID: 36550398 PMCID: PMC9773469 DOI: 10.1186/s12874-022-01811-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 12/01/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Precision medicine is an emerging field that involves the selection of treatments based on patients' individual prognostic data. It is formalized through the identification of individualized treatment rules (ITRs) that maximize a clinical outcome. When the type of outcome is time-to-event, the correct handling of censoring is crucial for estimating reliable optimal ITRs. METHODS We propose a jackknife estimator of the value function to allow for right-censored data for a binary treatment. The jackknife estimator or leave-one-out-cross-validation approach can be used to estimate the value function and select optimal ITRs using existing machine learning methods. We address the issue of censoring in survival data by introducing an inverse probability of censoring weighted (IPCW) adjustment in the expression of the jackknife estimator of the value function. In this paper, we estimate the optimal ITR by using random survival forest (RSF) and Cox proportional hazards model (COX). We use a Z-test to compare the optimal ITRs learned by RSF and COX with the zero-order model (or one-size-fits-all). Through simulation studies, we investigate the asymptotic properties and the performance of our proposed estimator under different censoring rates. We illustrate our proposed method on a phase III clinical trial of non-small cell lung cancer data. RESULTS Our simulations show that COX outperforms RSF for small sample sizes. As sample sizes increase, the performance of RSF improves, in particular when the expected log failure time is not linear in the covariates. The estimator is fairly normally distributed across different combinations of simulation scenarios and censoring rates. When applied to a non-small-cell lung cancer data set, our method determines the zero-order model (ZOM) as the best performing model. This finding highlights the possibility that tailoring may not be needed for this cancer data set. CONCLUSION The jackknife approach for estimating the value function in the presence of right-censored data shows satisfactory performance when there is small to moderate censoring. Winsorizing the upper and lower percentiles of the estimated survival weights for computing the IPCWs stabilizes the estimator.
Collapse
|
6
|
Xu Y, Greene TH, Bress AP, Bellows BK, Zhang Y, Zhang Z, Kolm P, Weintraub WS, Moran AS, Shen J. An efficient approach for optimizing the cost-effective individualized treatment rule using conditional random forest. Stat Methods Med Res 2022; 31:2122-2136. [PMID: 35912490 DOI: 10.1177/09622802221115876] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Evidence from observational studies has become increasingly important for supporting healthcare policy making via cost-effectiveness analyses. Similar as in comparative effectiveness studies, health economic evaluations that consider subject-level heterogeneity produce individualized treatment rules that are often more cost-effective than one-size-fits-all treatment. Thus, it is of great interest to develop statistical tools for learning such a cost-effective individualized treatment rule under the causal inference framework that allows proper handling of potential confounding and can be applied to both trials and observational studies. In this paper, we use the concept of net-monetary-benefit to assess the trade-off between health benefits and related costs. We estimate cost-effective individualized treatment rule as a function of patients' characteristics that, when implemented, optimizes the allocation of limited healthcare resources by maximizing health gains while minimizing treatment-related costs. We employ the conditional random forest approach and identify the optimal cost-effective individualized treatment rule using net-monetary-benefit-based classification algorithms, where two partitioned estimators are proposed for the subject-specific weights to effectively incorporate information from censored individuals. We conduct simulation studies to evaluate the performance of our proposals. We apply our top-performing algorithm to the NIH-funded Systolic Blood Pressure Intervention Trial to illustrate the cost-effectiveness gains of assigning customized intensive blood pressure therapy.
Collapse
Affiliation(s)
- Yizhe Xu
- Department of Population Health Sciences, 7060University of Utah, SLC, UT, USA
| | - Tom H Greene
- Department of Population Health Sciences, 7060University of Utah, SLC, UT, USA
| | - Adam P Bress
- Department of Population Health Sciences, 7060University of Utah, SLC, UT, USA
| | | | - Yue Zhang
- Department of Population Health Sciences, 7060University of Utah, SLC, UT, USA
| | - Zugui Zhang
- 5973Christiana Care Health System, Newark, DE, USA
| | - Paul Kolm
- Department of Medicine, 121577MedStar Health Research Institute, Washington, DC, USA
| | - William S Weintraub
- Department of Medicine, 121577MedStar Health Research Institute, Washington, DC, USA
| | - Andrew S Moran
- 21611Columbia University Medical Center, New York, NY, USA
| | - Jincheng Shen
- Department of Population Health Sciences, 7060University of Utah, SLC, UT, USA
| |
Collapse
|
7
|
Behrouz A, Lécuyer M, Rudin C, Seltzer M. Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design. CEUR WORKSHOP PROCEEDINGS 2022; 3318:26. [PMID: 36970634 PMCID: PMC10039433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
Abstract
Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be directly used. For example, none of the existing techniques produce policies that incorporate inverse propensity weighting on individual data points. We present three algorithms for efficient sparse weighted decision tree optimization. The first approach directly optimizes the weighted loss function; however, it tends to be computationally inefficient for large datasets. Our second approach, which scales more efficiently, transforms weights to integer values and uses data duplication to transform the weighted decision tree optimization problem into an unweighted (but larger) counterpart. Our third algorithm, which scales to much larger datasets, uses a randomized procedure that samples each data point with a probability proportional to its weight. We present theoretical bounds on the error of the two fast methods and show experimentally that these methods can be two orders of magnitude faster than the direct optimization of the weighted loss, without losing significant accuracy.
Collapse
Affiliation(s)
- Ali Behrouz
- University of British Columbia Vancouver, British Columbia, Canada
| | - Mathias Lécuyer
- University of British Columbia Vancouver, British Columbia, Canada
| | | | - Margo Seltzer
- University of British Columbia Vancouver, British Columbia, Canada
| |
Collapse
|
8
|
Zhou Y, Wang L, Song R, Zhao T. Transformation-Invariant Learning of Optimal Individualized Decision Rules with Time-to-Event Outcomes. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2068420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Yu Zhou
- Roku, San Jose, United States
| | - Lan Wang
- Department of Management Science, University of Miami
| | - Rui Song
- Department of Statistics, North Carolina State University
| | - Tuoyi Zhao
- Department of Management Science, University of Miami
| |
Collapse
|
9
|
Qi Z, Pang JS, Liu Y. On Robustness of Individualized Decision Rules. J Am Stat Assoc 2022; 118:2143-2157. [PMID: 38143785 PMCID: PMC10746134 DOI: 10.1080/01621459.2022.2038180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 01/21/2022] [Indexed: 10/18/2022]
Abstract
With the emergence of precision medicine, estimating optimal individualized decision rules (IDRs) has attracted tremendous attention in many scientific areas. Most existing literature has focused on finding optimal IDRs that can maximize the expected outcome for each individual. Motivated by complex individualized decision making procedures and the popular conditional value at risk (CVaR) measure, we propose a new robust criterion to estimate optimal IDRs in order to control the average lower tail of the individuals' outcomes. In addition to improving the individualized expected outcome, our proposed criterion takes risks into consideration, and thus the resulting IDRs can prevent adverse events. The optimal IDR under our criterion can be interpreted as the decision rule that maximizes the "worst-case" scenario of the individualized outcome when the underlying distribution is perturbed within a constrained set. An efficient non-convex optimization algorithm is proposed with convergence guarantees. We investigate theoretical properties for our estimated optimal IDRs under the proposed criterion such as consistency and finite sample error bounds. Simulation studies and a real data application are used to further demonstrate the robust performance of our methods. Several extensions of the proposed method are also discussed.
Collapse
Affiliation(s)
- Zhengling Qi
- Department of Decision Sciences, George Washington University
| | - Jong-Shi Pang
- Department of Industrial and Systems Engineering, University of Southern California, LA
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Sciences, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, NC 27599, USA
| |
Collapse
|
10
|
Xu R, Chen G, Connor M, Murphy J. Novel Use of Patient-Specific Covariates From Oncology Studies in the Era of Biomedical Data Science: A Review of Latest Methodologies. J Clin Oncol 2022; 40:3546-3553. [PMID: 35258995 DOI: 10.1200/jco.21.01957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In this article, we review different applications of how to incorporate individual patient variables into clinical research within oncology. These methodologies range from the more traditional use of baseline covariates from randomized clinical trials, as well as observational studies, to using covariates to generalize the results of randomized clinical trials to other populations. Individual patient variables also allow for the consideration of heterogeneity in treatment effects and individualized treatment rules. We primarily consider two treatment groups and mostly focus on time-to-event outcomes where such methodologies have been well established and widely applied. We also discuss more conceptually newer statistical research that has not been widely applied in clinical oncology, but is likely to make an impact in future oncology research. With the increasing amount of biomedical data available for analysis, it is inevitable that more methods are developed to make best use of information, to advance oncology research.
Collapse
Affiliation(s)
- Ronghui Xu
- Univerity of California, San Diego, San Diego, CA
| | | | | | - James Murphy
- Univerity of California, San Diego, San Diego, CA
| |
Collapse
|
11
|
Xu Y, Greene TH, Bress AP, Sauer BC, Bellows BK, Zhang Y, Weintraub WS, Moran AE, Shen J. Estimating the optimal individualized treatment rule from a cost-effectiveness perspective. Biometrics 2022; 78:337-351. [PMID: 33215693 PMCID: PMC8134511 DOI: 10.1111/biom.13406] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 10/30/2020] [Accepted: 11/06/2020] [Indexed: 11/27/2022]
Abstract
Optimal individualized treatment rules (ITRs) provide customized treatment recommendations based on subject characteristics to maximize clinical benefit in accordance with the objectives in precision medicine. As a result, there is growing interest in developing statistical tools for estimating optimal ITRs in evidence-based research. In health economic perspectives, policy makers consider the tradeoff between health gains and incremental costs of interventions to set priorities and allocate resources. However, most work on ITRs has focused on maximizing the effectiveness of treatment without considering costs. In this paper, we jointly consider the impact of effectiveness and cost on treatment decisions and define ITRs under a composite-outcome setting, so that we identify the most cost-effective ITR that accounts for individual-level heterogeneity through direct optimization. In particular, we propose a decision-tree-based statistical learning algorithm that uses a net-monetary-benefit-based reward to provide nonparametric estimations of the optimal ITR. We provide several approaches to estimating the reward underlying the ITR as a function of subject characteristics. We present the strengths and weaknesses of each approach and provide practical guidelines by comparing their performance in simulation studies. We illustrate the top-performing approach from our simulations by evaluating the projected 15-year personalized cost-effectiveness of the intensive blood pressure control of the Systolic Blood Pressure Intervention Trial (SPRINT) study.
Collapse
Affiliation(s)
- Yizhe Xu
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah
| | - Tom H. Greene
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Internal Medicine, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah
| | - Adam P. Bress
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah
| | - Brian C. Sauer
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah,Salt Lake City Veterans Affairs Medical Center, Salt Lake City, Utah
| | - Brandon K. Bellows
- Department of Medicine, Columbia University Medical Center, New York, New York
| | - Yue Zhang
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Internal Medicine, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah
| | | | - Andrew E. Moran
- Department of Medicine, Columbia University Medical Center, New York, New York
| | - Jincheng Shen
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Internal Medicine, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah
| |
Collapse
|
12
|
Gerber G, Faou YL, Lopez O, Trupin M. The Impact of Churn on Client Value in Health Insurance, Evaluation Using a Random Forest Under Various Censoring Mechanisms. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2020.1764364] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
| | - Yohann Le Faou
- Forsides & Sorbonne Université, CNRS, Laboratoire de Probabilités, Statistique et Modélisation, LPSM, Paris, France
| | - Olivier Lopez
- Sorbonne Université, CNRS, Laboratoire de Probabilités, Statistique et Modélisation, LPSM, Paris, France
| | | |
Collapse
|
13
|
Survival Augmented Patient Preference Incorporated Reinforcement Learning to Evaluate Tailoring Variables for Personalized Healthcare. STATS 2021. [DOI: 10.3390/stats4040046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
In this paper, we consider personalized treatment decision strategies in the management of chronic diseases, such as chronic kidney disease, which typically consists of sequential and adaptive treatment decision making. We investigate a two-stage treatment setting with a survival outcome that could be right censored. This can be formulated through a dynamic treatment regime (DTR) framework, where the goal is to tailor treatment to each individual based on their own medical history in order to maximize a desirable health outcome. We develop a new method, Survival Augmented Patient Preference incorporated reinforcement Q-Learning (SAPP-Q-Learning) to decide between quality of life and survival restricted at maximal follow-up. Our method incorporates the latent patient preference into a weighted utility function that balances between quality of life and survival time, in a Q-learning model framework. We further propose a corresponding m-out-of-n Bootstrap procedure to accurately make statistical inferences and construct confidence intervals on the effects of tailoring variables, whose values can guide personalized treatment strategies.
Collapse
|
14
|
Zhang P, Ma J, Chen X, Shentu Y. A nonparametric method for value function guided subgroup identification via gradient tree boosting for censored survival data. Stat Med 2020; 39:4133-4146. [PMID: 32786155 DOI: 10.1002/sim.8714] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2020] [Revised: 06/08/2020] [Accepted: 07/09/2020] [Indexed: 11/07/2022]
Abstract
In randomized clinical trials with survival outcome, there has been an increasing interest in subgroup identification based on baseline genomic, proteomic markers, or clinical characteristics. Some of the existing methods identify subgroups that benefit substantially from the experimental treatment by directly modeling outcomes or treatment effect. When the goal is to find an optimal treatment for a given patient rather than finding the right patient for a given treatment, methods under the individualized treatment regime framework estimate an individualized treatment rule that would lead to the best expected clinical outcome as measured by a value function. Connecting the concept of value function to subgroup identification, we propose a nonparametric method that searches for subgroup membership scores by maximizing a value function that directly reflects the subgroup-treatment interaction effect based on restricted mean survival time. A gradient tree boosting algorithm is proposed to search for the individual subgroup membership scores. We conduct simulation studies to evaluate the performance of the proposed method and an application to an AIDS clinical trial is performed for illustration.
Collapse
Affiliation(s)
- Pingye Zhang
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Junshui Ma
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Xinqun Chen
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Yue Shentu
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| |
Collapse
|
15
|
Zhang H, Huang J, Sun L. A rank-based approach to estimating monotone individualized two treatment regimes. Comput Stat Data Anal 2020. [DOI: 10.1016/j.csda.2020.107015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
16
|
Wu Y, Wang L. Resampling-based confidence intervals for model-free robust inference on optimal treatment regimes. Biometrics 2020; 77:465-476. [PMID: 32687215 DOI: 10.1111/biom.13337] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 06/24/2020] [Indexed: 12/01/2022]
Abstract
We propose a new procedure for inference on optimal treatment regimes in the model-free setting, which does not require to specify an outcome regression model. Existing model-free estimators for optimal treatment regimes are usually not suitable for the purpose of inference, because they either have nonstandard asymptotic distributions or do not necessarily guarantee consistent estimation of the parameter indexing the Bayes rule due to the use of surrogate loss. We first study a smoothed robust estimator that directly targets the parameter corresponding to the Bayes decision rule for optimal treatment regimes estimation. This estimator is shown to have an asymptotic normal distribution. Furthermore, we verify that a resampling procedure provides asymptotically accurate inference for both the parameter indexing the optimal treatment regime and the optimal value function. A new algorithm is developed to calculate the proposed estimator with substantially improved speed and stability. Numerical results demonstrate the satisfactory performance of the new methods.
Collapse
Affiliation(s)
- Yunan Wu
- School of Statistics, University of Minnesota, Minneapolis, Minnesota
| | - Lan Wang
- Department of Management Science, University of Miami, Coral Gables, Florida
| |
Collapse
|
17
|
Cui Y, Tchetgen ET. A semiparametric instrumental variable approach to optimal treatment regimes under endogeneity. J Am Stat Assoc 2020; 116:162-173. [PMID: 33994604 PMCID: PMC8118566 DOI: 10.1080/01621459.2020.1783272] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 02/05/2020] [Accepted: 06/09/2020] [Indexed: 01/23/2023]
Abstract
There is a fast-growing literature on estimating optimal treatment regimes based on randomized trials or observational studies under a key identifying condition of no unmeasured confounding. Because confounding by unmeasured factors cannot generally be ruled out with certainty in observational studies or randomized trials subject to noncompliance, we propose a general instrumental variable approach to learning optimal treatment regimes under endogeneity. Specifically, we establish identification of both value function E [ Y D ( L ) ] for a given regime D and optimal regimes arg max D E [ Y D ( L ) ] with the aid of a binary instrumental variable, when no unmeasured confounding fails to hold. We also construct novel multiply robust classification-based estimators. Furthermore, we propose to identify and estimate optimal treatment regimes among those who would comply to the assigned treatment under a monotonicity assumption. In this latter case, we establish the somewhat surprising result that complier optimal regimes can be consistently estimated without directly collecting compliance information and therefore without the complier average treatment effect itself being identified. Our approach is illustrated via extensive simulation studies and a data application on the effect of child rearing on labor participation.
Collapse
Affiliation(s)
- Yifan Cui
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104
| | - Eric Tchetgen Tchetgen
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104
| |
Collapse
|
18
|
Meng H, Zhao YQ, Fu H, Qiao X. Near-optimal Individualized Treatment Recommendations. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2020; 21:183. [PMID: 34335111 PMCID: PMC8324003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The individualized treatment recommendation (ITR) is an important analytic framework for precision medicine. The goal of ITR is to assign the best treatments to patients based on their individual characteristics. From the machine learning perspective, the solution to the ITR problem can be formulated as a weighted classification problem to maximize the mean benefit from the recommended treatments given patients' characteristics. Several ITR methods have been proposed in both the binary setting and the multicategory setting. In practice, one may prefer a more flexible recommendation that includes multiple treatment options. This motivates us to develop methods to obtain a set of near-optimal individualized treatment recommendations alternative to each other, called alternative individualized treatment recommendations (A-ITR). We propose two methods to estimate the optimal A-ITR within the outcome weighted learning (OWL) framework. Simulation studies and a real data analysis for Type 2 diabetic patients with injectable antidiabetic treatments are conducted to show the usefulness of the proposed A-ITR framework. We also show the consistency of these methods and obtain an upper bound for the risk between the theoretically optimal recommendation and the estimated one. An R package aitr has been developed, found at https://github.com/menghaomiao/aitr.
Collapse
Affiliation(s)
- Haomiao Meng
- Department of Mathematical Sciences, Binghamton University, State University of New York, Binghamton, NY 13902, USA
| | - Ying-Qi Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Haoda Fu
- Eli Lilly and Company, Indianapolis, IN 46285, USA
| | - Xingye Qiao
- Department of Mathematical Sciences, Binghamton University, State University of New York, Binghamton, NY 13902, USA
| |
Collapse
|
19
|
Sugasawa S, Noma H. Estimating individual treatment effects by gradient boosting trees. Stat Med 2019; 38:5146-5159. [DOI: 10.1002/sim.8357] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Revised: 07/13/2019] [Accepted: 08/02/2019] [Indexed: 11/08/2022]
Affiliation(s)
- Shonosuke Sugasawa
- Center for Spatial Information Science The University of Tokyo Kashiwa Japan
- Research Center for Medical and Health Data Science The Institute of Statistical Mathematics Tokyo Japan
| | - Hisashi Noma
- Research Center for Medical and Health Data Science The Institute of Statistical Mathematics Tokyo Japan
- Department of Data Science The Institute of Statistical Mathematics Tokyo Japan
| |
Collapse
|
20
|
Qi Z, Liu D, Fu H, Liu Y. Multi-Armed Angle-Based Direct Learning for Estimating Optimal Individualized Treatment Rules With Various Outcomes. J Am Stat Assoc 2019; 115:678-691. [PMID: 34219848 DOI: 10.1080/01621459.2018.1529597] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Estimating an optimal individualized treatment rule (ITR) based on patients' information is an important problem in precision medicine. An optimal ITR is a decision function that optimizes patients' expected clinical outcomes. Many existing methods in the literature are designed for binary treatment settings with the interest of a continuous outcome. Much less work has been done on estimating optimal ITRs in multiple treatment settings with good interpretations. In this article, we propose angle-based direct learning (AD-learning) to efficiently estimate optimal ITRs with multiple treatments. Our proposed method can be applied to various types of outcomes, such as continuous, survival, or binary outcomes. Moreover, it has an interesting geometric interpretation on the effect of different treatments for each individual patient, which can help doctors and patients make better decisions. Finite sample error bounds have been established to provide a theoretical guarantee for AD-learning. Finally, we demonstrate the superior performance of our method via an extensive simulation study and real data applications. Supplementary materials for this article are available online.
Collapse
Affiliation(s)
- Zhengling Qi
- Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC
| | - Dacheng Liu
- Boehringer Ingelheim Pharmaceuticals, Inc., Ridgefield, CT
| | - Haoda Fu
- Eli Lilly and Company, Lilly Corporate Center, Indianapolis, IN
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Sciences, Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC
| |
Collapse
|
21
|
Abstract
Precision medicine seeks to maximize the quality of healthcare by individualizing the healthcare process to the uniquely evolving health status of each patient. This endeavor spans a broad range of scientific areas including drug discovery, genetics/genomics, health communication, and causal inference all in support of evidence-based, i.e., data-driven, decision making. Precision medicine is formalized as a treatment regime which comprises a sequence of decision rules, one per decision point, which map up-to-date patient information to a recommended action. The potential actions could be the selection of which drug to use, the selection of dose, timing of administration, specific diet or exercise recommendation, or other aspects of treatment or care. Statistics research in precision medicine is broadly focused on methodological development for estimation of and inference for treatment regimes which maximize some cumulative clinical outcome. In this review, we provide an overview of this vibrant area of research and present important and emerging challenges.
Collapse
Affiliation(s)
- Michael R Kosorok
- Department of Biostatistics and Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599, U.S.A.;
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleight, North Carolina, 27695, U.S.A.;
| |
Collapse
|
22
|
Abstract
We propose a projection pursuit technique in survival analysis for finding lower-dimensional projections that exhibit differentiated survival outcome. This idea is formally introduced as the change-plane Cox model, a non-regular Cox model with a change-plane in the covariate space dividing the population into two subgroups whose hazards are proportional. The proposed technique offers a potential framework for principled subgroup discovery. Estimation of the change-plane is accomplished via likelihood maximization over a data-driven sieve constructed using sliced inverse regression. Consistency of the sieve procedure for the change-plane parameters is established. In simulations the sieve estimator demonstrates better classification performance for subgroup identification than alternatives.
Collapse
Affiliation(s)
- Susan Wei
- Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota 55455, U.S.A.,
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A.,
| |
Collapse
|
23
|
|