1
|
Wang T, Keil AP, Kim S, Wyss R, Htoo PT, Funk MJ, Buse JB, Kosorok MR, Stürmer T. Iterative Causal Forest: A Novel Algorithm for Subgroup Identification. Am J Epidemiol 2024; 193:764-776. [PMID: 37943684 DOI: 10.1093/aje/kwad219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 10/27/2023] [Accepted: 11/01/2023] [Indexed: 11/12/2023] Open
Abstract
Precisely and efficiently identifying subgroups with heterogeneous treatment effects (HTEs) in real-world evidence studies remains a challenge. Based on the causal forest (CF) method, we developed an iterative CF (iCF) algorithm to identify HTEs in subgroups defined by important variables. Our method iteratively grows different depths of the CF with important effect modifiers, performs plurality votes to obtain decision trees (subgroup decisions) for a family of CFs with different depths, and then finds the cross-validated subgroup decision that best predicts the treatment effect as a final subgroup decision. We simulated 12 different scenarios and showed that the iCF outperformed other machine learning methods for interaction/subgroup identification in the majority of scenarios assessed. Using a 20% random sample of fee-for-service Medicare beneficiaries initiating sodium-glucose cotransporter-2 inhibitors or glucagon-like peptide-1 receptor agonists, we implemented the iCF to identify subgroups with HTEs for hospitalized heart failure. Consistent with previous studies suggesting patients with heart failure benefit more from sodium-glucose cotransporter-2 inhibitors, iCF successfully identified such a subpopulation with HTEs and additive interactions. The iCF is a promising method for identifying subgroups with HTEs in real-world data where the potential for unmeasured confounding can be limited by study design.
Collapse
|
2
|
Bornkamp B, Zaoli S, Azzarito M, Martin R, Müller CP, Moloney C, Capestro G, Ohlssen D, Baillie M. Predicting subgroup treatment effects for a new study: Motivations, results and learnings from running a data challenge in a pharmaceutical corporation. Pharm Stat 2024. [PMID: 38326967 DOI: 10.1002/pst.2368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 12/01/2023] [Accepted: 01/21/2024] [Indexed: 02/09/2024]
Abstract
We present the motivation, experience, and learnings from a data challenge conducted at a large pharmaceutical corporation on the topic of subgroup identification. The data challenge aimed at exploring approaches to subgroup identification for future clinical trials. To mimic a realistic setting, participants had access to 4 Phase III clinical trials to derive a subgroup and predict its treatment effect on a future study not accessible to challenge participants. A total of 30 teams registered for the challenge with around 100 participants, primarily from Biostatistics organization. We outline the motivation for running the challenge, the challenge rules, and logistics. Finally, we present the results of the challenge, the participant feedback as well as the learnings. We also present our view on the implications of the results on exploratory analyses related to treatment effect heterogeneity.
Collapse
Affiliation(s)
- Björn Bornkamp
- Global Drug Development, Novartis Pharma AG, Basel, Switzerland
| | - Silvia Zaoli
- Global Drug Development, Novartis Pharma AG, Basel, Switzerland
| | | | - Ruvie Martin
- Global Drug Development, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | | | - Conor Moloney
- Global Drug Development, Novartis Pharma AG, Dublin, Ireland
| | - Giulia Capestro
- Global Drug Development, Novartis Pharma AG, Basel, Switzerland
| | - David Ohlssen
- Global Drug Development, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | - Mark Baillie
- Global Drug Development, Novartis Pharma AG, Basel, Switzerland
| |
Collapse
|
3
|
Ruberg S, Zhang Y, Showalter H, Shen L. A platform for comparing subgroup identification methodologies. Biom J 2024; 66:e2200164. [PMID: 37147787 DOI: 10.1002/bimj.202200164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 02/24/2023] [Accepted: 03/01/2023] [Indexed: 05/07/2023]
Abstract
Since the advent of the phrase "subgroup identification," there has been an explosion of methodologies that seek to identify meaningful subgroups of patients with exceptional response in order to further the realization of personalized medicine. However, to perform fair comparison and understand what methods work best under different clinical trials situations, a common platform is needed for comparative effectiveness of these various approaches. In this paper, we describe a comprehensive project that created an extensive platform for evaluating subgroup identification methods as well as a publicly posted challenge that was used to elicit new approaches. We proposed a common data-generating model for creating virtual clinical trial datasets that contain subgroups of exceptional responders encompassing the many dimensions of the problem or null scenarios in which there are no such subgroups. Furthermore, we created a common scoring system for evaluating performance of purported methods for identifying subgroups. This makes it possible to benchmark methodologies in order to understand what methods work best under different clinical trial situations. The findings from this project produced considerable insights and allow us to make recommendations for how the statistical community can better compare and contrast old and new subgroup identification methodologies.
Collapse
Affiliation(s)
| | - Ying Zhang
- Global Statistical Sciences, Eli Lilly and Company, Indianapolis, Indiana, USA
| | - Hollins Showalter
- Global Statistical Sciences, Eli Lilly and Company, Indianapolis, Indiana, USA
| | - Lei Shen
- Global Statistical Sciences, Eli Lilly and Company, Indianapolis, Indiana, USA
| |
Collapse
|
4
|
Sun S, Sechidis K, Chen Y, Lu J, Ma C, Mirshani A, Ohlssen D, Vandemeulebroecke M, Bornkamp B. Comparing algorithms for characterizing treatment effect heterogeneity in randomized trials. Biom J 2024; 66:e2100337. [PMID: 36437036 DOI: 10.1002/bimj.202100337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 10/04/2022] [Accepted: 10/16/2022] [Indexed: 11/29/2022]
Abstract
The identification and estimation of heterogeneous treatment effects in biomedical clinical trials are challenging, because trials are typically planned to assess the treatment effect in the overall trial population. Nevertheless, the identification of how the treatment effect may vary across subgroups is of major importance for drug development. In this work, we review some existing simulation work and perform a simulation study to evaluate recent methods for identifying and estimating the heterogeneous treatments effects using various metrics and scenarios relevant for drug development. Our focus is not only on a comparison of the methods in general, but on how well these methods perform in simulation scenarios that reflect real clinical trials. We provide the R package benchtm that can be used to simulate synthetic biomarker distributions based on real clinical trial data and to create interpretable scenarios to benchmark methods for identification and estimation of treatment effect heterogeneity.
Collapse
Affiliation(s)
- Sophie Sun
- Advanced Methodology and Data Science, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | | | - Yao Chen
- Advanced Methodology and Data Science, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | - Jiarui Lu
- Advanced Methodology and Data Science, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | - Chong Ma
- Early Development Analytics, Novartis Pharmaceuticals Corporation, Cambridge, Massachusetts, USA
| | - Ardalan Mirshani
- Advanced Methodology and Data Science, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | - David Ohlssen
- Advanced Methodology and Data Science, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | | | - Björn Bornkamp
- Advanced Methodology and Data Science, Novartis Pharma AG, Basel, Switzerland
| |
Collapse
|
5
|
Zhang J, Zhang P, Ma J, Shentu Y. Covariate-adjusted value-guided subgroup identification via boosting. J Biopharm Stat 2023:1-18. [PMID: 37955423 DOI: 10.1080/10543406.2023.2275757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Accepted: 10/22/2023] [Indexed: 11/14/2023]
Abstract
It is widely recognized that treatment effects could differ across subgroups of patients. Subgroup analysis, which assesses such heterogeneity, provides valuable information in developing personalized therapies. There has been extensive research developing novel statistical methods for subgroup identification. The recent contribution is a value-guided subgroup identification method that directly maximizes treatment benefit at the subgroup level for survival outcome, rather than relying on individual treatment effect estimation. In this paper, we first completed this framework by illustrating its application to continuous and binary outcomes. More importantly, we extended the original framework to account for the prognostic effects and named this new method Covariate-Adjusted Value-guided subgroup identification via boosting (CAVboost). The original method directly used the outcome to formulate the value function for subgroup identification. Since the outcome can further be decomposed as prognostic effects and treatment effects, specifying the prognostic effects as the covariates of a model for the outcome can single out the treatment effects and improve the power to detect them across subgroups. Our proposed CAVboost was based on this key idea. It used a covariate-adjusted treatment effect estimator, instead of the outcome itself, to formulate the value function for subgroup identification. CAVboost estimates the treatment effect by using covariates to account for the prognostic effects, which mimics the idea of using covariates in an ANCOVA estimator. We showed that CAVboost could effectively improve the subgroup identification capability for both continuous and binary outcomes.
Collapse
Affiliation(s)
| | - Pingye Zhang
- Gilead Sciences Inc, Foster City, California, USA
| | - Junshui Ma
- Merck & Co. MRL, BARDS, Rahway, New Jersey, USA
| | - Yue Shentu
- Merck & Co. MRL, BARDS, Rahway, New Jersey, USA
| |
Collapse
|
6
|
Simon N. Considerations for identifying the "right" subgroup in adaptive enrichment trials. Clin Trials 2023:17407745231174909. [PMID: 37269222 DOI: 10.1177/17407745231174909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Adaptive Enrichment Trials aim to make efficient use of data in a pivotal trial of a new targeted therapy to both (a) more precisely identify who benefits from that therapy and (b) improve the likelihood of successfully concluding that the drug is effective, while controlling the probability of false positives. There are a number of frameworks for conducting such a trial and decisions that must be made regarding how to identify that target subgroup. Among those decisions, one must choose how aggressively to restrict enrollment criteria based on the accumulating evidence in the trial. In this article, we empirically evaluate the impact of aggressive versus conservative enrollment restrictions on the power of the trial to detect an effect of treatment. We identify that, in some cases, a more aggressive strategy can substantially improve power. This additionally raises an important question regarding label indication: To what degree do we need a formal test of the hypothesis of no treatment effect in the exact population implied by the label indication? We discuss this question and evaluate how our answer for adaptive enrichment trials may relate to the answer implied by current practice for broad eligibility trials.
Collapse
Affiliation(s)
- Noah Simon
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| |
Collapse
|
7
|
Al-Tashi Q, Saad MB, Muneer A, Qureshi R, Mirjalili S, Sheshadri A, Le X, Vokes NI, Zhang J, Wu J. Machine Learning Models for the Identification of Prognostic and Predictive Cancer Biomarkers: A Systematic Review. Int J Mol Sci 2023; 24:7781. [PMID: 37175487 PMCID: PMC10178491 DOI: 10.3390/ijms24097781] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/10/2023] [Accepted: 04/19/2023] [Indexed: 05/15/2023] Open
Abstract
The identification of biomarkers plays a crucial role in personalized medicine, both in the clinical and research settings. However, the contrast between predictive and prognostic biomarkers can be challenging due to the overlap between the two. A prognostic biomarker predicts the future outcome of cancer, regardless of treatment, and a predictive biomarker predicts the effectiveness of a therapeutic intervention. Misclassifying a prognostic biomarker as predictive (or vice versa) can have serious financial and personal consequences for patients. To address this issue, various statistical and machine learning approaches have been developed. The aim of this study is to present an in-depth analysis of recent advancements, trends, challenges, and future prospects in biomarker identification. A systematic search was conducted using PubMed to identify relevant studies published between 2017 and 2023. The selected studies were analyzed to better understand the concept of biomarker identification, evaluate machine learning methods, assess the level of research activity, and highlight the application of these methods in cancer research and treatment. Furthermore, existing obstacles and concerns are discussed to identify prospective research areas. We believe that this review will serve as a valuable resource for researchers, providing insights into the methods and approaches used in biomarker discovery and identifying future research opportunities.
Collapse
Affiliation(s)
- Qasem Al-Tashi
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Maliazurina B. Saad
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Amgad Muneer
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Rizwan Qureshi
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Seyedali Mirjalili
- Centre for Artificial Intelligence Research and Optimization, Torrens University Australia, Fortitude Valley, Brisbane, QLD 4006, Australia
- Yonsei Frontier Lab, Yonsei University, Seoul 03722, Republic of Korea
- University Research and Innovation Center, Obuda University, 1034 Budapest, Hungary
| | - Ajay Sheshadri
- Department of Pulmonary Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Xiuning Le
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Natalie I. Vokes
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Jianjun Zhang
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Jia Wu
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
8
|
Yang H, Vu T, Long Q, Calhoun V, Adali T. Identification of Homogeneous Subgroups from Resting-State fMRI Data. Sensors (Basel) 2023; 23:s23063264. [PMID: 36991975 PMCID: PMC10051904 DOI: 10.3390/s23063264] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 03/04/2023] [Accepted: 03/14/2023] [Indexed: 06/12/2023]
Abstract
The identification of homogeneous subgroups of patients with psychiatric disorders can play an important role in achieving personalized medicine and is essential to provide insights for understanding neuropsychological mechanisms of various mental disorders. The functional connectivity profiles obtained from functional magnetic resonance imaging (fMRI) data have been shown to be unique to each individual, similar to fingerprints; however, their use in characterizing psychiatric disorders in a clinically useful way is still being studied. In this work, we propose a framework that makes use of functional activity maps for subgroup identification using the Gershgorin disc theorem. The proposed pipeline is designed to analyze a large-scale multi-subject fMRI dataset with a fully data-driven method, a new constrained independent component analysis algorithm based on entropy bound minimization (c-EBM), followed by an eigenspectrum analysis approach. A set of resting-state network (RSN) templates is generated from an independent dataset and used as constraints for c-EBM. The constraints present a foundation for subgroup identification by establishing a connection across the subjects and aligning subject-wise separate ICA analyses. The proposed pipeline was applied to a dataset comprising 464 psychiatric patients and discovered meaningful subgroups. Subjects within the identified subgroups share similar activation patterns in certain brain areas. The identified subgroups show significant group differences in multiple meaningful brain areas including dorsolateral prefrontal cortex and anterior cingulate cortex. Three sets of cognitive test scores were used to verify the identified subgroups, and most of them showed significant differences across subgroups, which provides further confirmation of the identified subgroups. In summary, this work represents an important step forward in using neuroimaging data to characterize mental disorders.
Collapse
Affiliation(s)
- Hanlu Yang
- Department of Computer Science and Electrical Engineering, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | - Trung Vu
- Department of Computer Science and Electrical Engineering, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | - Qunfang Long
- Department of Computer Science and Electrical Engineering, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | - Vince Calhoun
- Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, and Emory University, Atlanta, GA 30303, USA
| | - Tülay Adali
- Department of Computer Science and Electrical Engineering, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| |
Collapse
|
9
|
Yuki S, Tanioka K, Yadohisa H. Estimation and visualization of heterogeneous treatment effects for multiple outcomes. Stat Med 2023; 42:693-715. [PMID: 36574770 DOI: 10.1002/sim.9638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 12/12/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022]
Abstract
We consider two-arm comparison in clinical trials. The objective is to identify a population with characteristics that make the treatment effective. Such a population is called a subgroup. This identification can be made by estimating the treatment effect and identifying the interactions between treatments and covariates. For a single outcome, there are several ways available to identify the subgroups. There are also multiple outcomes, but they are difficult to interpret and cannot be applied to outcomes other than continuous values. In this paper, we thus propose a new method that allows for a straightforward interpretation of subgroups and deals with both continuous and binary outcomes. The proposed method introduces latent variables and adds Lasso sparsity constraints to the estimated loadings to facilitate the interpretation of the relationship between outcomes and covariates. The interpretation of the subgroups is made by visualizing treatment effects and latent variables. Since we are performing sparse estimation, we can interpret the covariates related to the treatment effects and subgroups. Finally, simulation and real data examples demonstrate the effectiveness of the proposed method.
Collapse
Affiliation(s)
- Shintaro Yuki
- Graduate School of Culture and Information Science, Doshisha University, Kyoto, Japan
| | - Kensuke Tanioka
- Department of Biomedical Sciences and Informatics, Doshisha University, Kyoto, Japan
| | - Hiroshi Yadohisa
- Department of Culture and Information Science, Doshisha University, Kyoto, Japan
| |
Collapse
|
10
|
Liu P, Li J, Kosorok MR. Change plane model averaging for subgroup identification. Stat Methods Med Res 2023; 32:773-788. [PMID: 36775991 DOI: 10.1177/09622802231154327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/14/2023]
Abstract
Central to personalized medicine and tailored therapies is discovering the subpopulations that account for treatment effect heterogeneity and are likely to benefit more from given interventions. In this article, we introduce a change plane model averaging method to identify subgroups characterized by linear combinations of predictive variables and multiple cut-offs. We first fit a sequence of statistical models, each incorporating the thresholding effect of one particular covariate. The estimation of submodels is accomplished through an iterative integration of a change point detection method and numerical optimization algorithms. A frequentist model averaging approach is then employed to linearly combine the submodels with optimal weights. Our approach can deal with high-dimensional settings involving enormous potential grouping variables by adopting the sparsity-inducing penalties. Simulation studies are conducted to investigate the prediction and subgrouping performance of the proposed method, with a comparison to various competing subgroup detection methods. Our method is applied to a dataset from a warfarin pharmacogenetics study, producing some new findings.
Collapse
Affiliation(s)
- Pan Liu
- Department of Statistics and Data Science, 37580National University of Singapore, Singapore, Singapore
| | - Jialiang Li
- Department of Statistics and Data Science, 37580National University of Singapore, Singapore, Singapore.,Duke University NUS Graduate Medical School, Singapore, Singapore
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, USA
| |
Collapse
|
11
|
Wang B, Li J, Wang X. Multi-threshold proportional hazards model and subgroup identification. Stat Med 2022; 41:5715-5737. [PMID: 36198478 DOI: 10.1002/sim.9589] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 07/22/2022] [Accepted: 09/19/2022] [Indexed: 11/09/2022]
Abstract
We propose a novel two-stage procedure for change point detection and parameter estimation in a multi-threshold proportional hazards model. In the first stage, we estimate the number of thresholds by formulating the threshold detection problem as a variable selection problem and applying the penalized partial likelihood approach. In the second stage, the change point locations are refined by a grid search and the standard inference for segment regression can then follow. The proposed model and estimation procedure could lend support to subgroup identification and personalized treatment recommendation in medical research. We establish the consistency of the threshold estimators and regression coefficient estimators under technical conditions. The finite sample performance of the method is demonstrated via simulation studies and two cancer data examples.
Collapse
Affiliation(s)
- Bing Wang
- School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning, China
| | - Jialiang Li
- Department of Statistics and Data Science, National University of Singapore, Singapore.,Duke University NUS Graduate Medical School, National University of Singapore, Singapore
| | - Xiaoguang Wang
- School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning, China.,Key Laboratory for Computational Mathematics and Data Intelligence of Liaoning Province, Dalian University of Technology, Dalian, Liaoning, China
| |
Collapse
|
12
|
Johnston SE, Lipkovich I, Dmitrienko A, Zhao YD. A two-stage adaptive clinical trial design with data-driven subgroup identification at interim analysis. Pharm Stat 2022; 21:1090-1108. [PMID: 35322520 PMCID: PMC10429034 DOI: 10.1002/pst.2208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 02/14/2022] [Accepted: 03/05/2022] [Indexed: 11/08/2022]
Abstract
In this paper, we consider randomized controlled clinical trials comparing two treatments in efficacy assessment using a time to event outcome. We assume a relatively small number of candidate biomarkers available in the beginning of the trial, which may help define an efficacy subgroup which shows differential treatment effect. The efficacy subgroup is to be defined by one or two biomarkers and cut-offs that are unknown to the investigator and must be learned from the data. We propose a two-stage adaptive design with a pre-planned interim analysis and a final analysis. At the interim, several subgroup-finding algorithms are evaluated to search for a subgroup with enhanced survival for treated versus placebo. Conditional powers computed based on the subgroup and the overall population are used to make decision at the interim to terminate the study for futility, continue the study as planned, or conduct sample size recalculation for the subgroup or the overall population. At the final analysis, combination tests together with closed testing procedures are used to determine efficacy in the subgroup or the overall population. We conducted simulation studies to compare our proposed procedures with several subgroup-identification methods in terms of a novel utility function and several other measures. This research demonstrated the benefit of incorporating data-driven subgroup selection into adaptive clinical trial designs.
Collapse
Affiliation(s)
- Sarah E. Johnston
- Global Biostatistics and Data Science, Bristol Myers Squibb, Berkeley Heights, New Jersey, USA
| | | | | | - Yan Daniel Zhao
- Department of Biostatistics and Epidemiology, University of Oklahoma Health Sciences Center, Oklahoma City, Oklahoma, USA
| |
Collapse
|
13
|
Wolf JM, Koopmeiners JS, Vock DM. A permutation procedure to detect heterogeneous treatments effects in randomized clinical trials while controlling the type I error rate. Clin Trials 2022; 19:512-521. [PMID: 35531765 DOI: 10.1177/17407745221095855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND/AIMS Secondary analyses of randomized clinical trials often seek to identify subgroups with differential treatment effects. These discoveries can help guide individual treatment decisions based on patient characteristics and identify populations for which additional treatments are needed. Traditional analyses require researchers to pre-specify potential subgroups to reduce the risk of reporting spurious results. There is a need for methods that can detect such subgroups without a priori specification while allowing researchers to control the probability of falsely detecting heterogeneous subgroups when treatment effects are uniform across the study population. METHODS We propose a permutation procedure for tuning parameter selection that allows for type I error control when testing for heterogeneous treatment effects framed within the Virtual Twins procedure for subgroup identification. We verify that the type I error rate can be controlled at the nominal rate and investigate the power for detecting heterogeneous effects when present through extensive simulation studies. We apply our method to a secondary analysis of data from a randomized trial of very low nicotine content cigarettes. RESULTS In the absence of type I error control, the observed type I error rate for Virtual Twins was between 99% and 100%. In contrast, models tuned via the proposed permutation were able to control the type I error rate and detect heterogeneous effects when present. An application of our approach to a recently completed trial of very low nicotine content cigarettes identified several variables with potentially heterogeneous treatment effects. CONCLUSIONS The proposed permutation procedure allows researchers to engage in secondary analyses of clinical trials for treatment effect heterogeneity while maintaining the type I error rate without pre-specifying subgroups.
Collapse
Affiliation(s)
- Jack M Wolf
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Joseph S Koopmeiners
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - David M Vock
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
14
|
Sun Y, Luo Z, Fan X. Robust structured heterogeneity analysis approach for high-dimensional data. Stat Med 2022; 41:3229-3259. [PMID: 35460280 DOI: 10.1002/sim.9414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 02/07/2022] [Accepted: 04/05/2022] [Indexed: 11/12/2022]
Abstract
Revealing relationships between genes and disease phenotypes is a critical problem in biomedical studies. This problem has been challenged by the heterogeneity of diseases. Patients of a perceived same disease may form multiple subgroups, and different subgroups have distinct sets of important genes. It is hence imperative to discover the latent subgroups and reveal the subgroup-specific important genes. Some heterogeneity analysis methods have been proposed in the recent literature. Despite considerable successes, most of the existing studies are still limited as they cannot accommodate data contamination and ignore the interconnections among genes. Aiming at these shortages, we develop a robust structured heterogeneity analysis approach to identify subgroups, select important genes as well as estimate their effects on the phenotype of interest. Possible data contamination is accommodated by employing the Huber loss function. A sparse overlapping group lasso penalty is imposed to conduct regularization estimation and gene identification, while taking into account the possibly overlapping cluster structure of genes. This approach takes an iterative strategy in the similar spirit of K-means clustering. Simulations demonstrate that the proposed approach outperforms alternatives in revealing the heterogeneity and selecting important genes for each subgroup. The analysis of Cancer Cell Line Encyclopedia data leads to biologically meaningful findings with improved prediction and grouping stability.
Collapse
Affiliation(s)
- Yifan Sun
- Center for Applied Statistics, Renmin University of China, Beijing, China.,School of Statistics, Renmin University of China, Beijing, China
| | - Ziye Luo
- School of Statistics, Renmin University of China, Beijing, China
| | - Xinyan Fan
- Center for Applied Statistics, Renmin University of China, Beijing, China.,School of Statistics, Renmin University of China, Beijing, China
| |
Collapse
|
15
|
Fang Y, Zhang B, Zhang M. Robust method for optimal treatment decision making based on survival data. Stat Med 2021; 40:6558-6576. [PMID: 34549828 DOI: 10.1002/sim.9198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 06/11/2021] [Accepted: 08/28/2021] [Indexed: 11/06/2022]
Abstract
Identifying the optimal treatment decision rule, where the best treatment for an individual varies according to his/her characteristics, is of great importance when treatment effect heterogeneity exists. We develop methods for estimating the optimal treatment decision rule based on data with survival time as the primary endpoint. Our methods are based on a flexible semiparametric accelerated failure time model, where only the treatment contrast (ie, the difference in means between treatments) is parameterized and all other aspects are unspecified. An individual's treatment contrast is firstly estimated robustly by an augmented inverse probability weighted estimator (AIPWE). Then the optimal decision rule is estimated by minimizing the loss between the treatment contrast and the AIPWE contrast. Two loss functions with different strategies to account for censoring are proposed. The proposed loss functions distinguish from existing ones in that they are based on treatment contrasts, which completely determine the optimal treatment rule. Our methods can further incorporate a penalty term to select variables that are only important for treatment decision making, while taking advantage of all covariates predictive of outcomes to improve performance. Comprehensive simulation studies have been conducted to evaluate performances of the proposed methods relative to existing methods. The proposed methods are illustrated with an application to the ACTG 175 clinical trial on HIV-infected patients.
Collapse
Affiliation(s)
- Yuexin Fang
- Department of Mathematics, Shanghai Normal University, Shanghai, P.R. China
| | - Baqun Zhang
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, P.R. China
| | - Min Zhang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
16
|
Wei Y, Hsu JC, Chen W, Chew EY, Ding Y. Identification and inference for subgroups with differential treatment efficacy from randomized controlled trials with survival outcomes through multiple testing. Stat Med 2021; 40:6523-6540. [PMID: 34542190 DOI: 10.1002/sim.9196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 08/26/2021] [Accepted: 08/27/2021] [Indexed: 11/08/2022]
Abstract
With the uptake of targeted therapies, instead of the "one-fits-all" approach, modern randomized controlled trials (RCTs) often aim to develop treatments that target a subgroup of patients. Motivated by analyzing the Age-Related Eye Disease Study (AREDS) data, a large RCT to study the efficacy of nutritional supplements in delaying the progression of an eye disease, age-related macular degeneration (AMD), we develop a simultaneous inference procedure to identify and infer subgroups with differential treatment efficacy in RCTs with time-to-event outcomes. Specifically, we formulate the multiple testing problem through contrasts and construct their simultaneous confidence intervals, which appropriately control both within- and across-marker multiplicity. Realistic simulations are conducted using real genotype data to evaluate the method performance under various scenarios. The method is then applied to AREDS to assess the efficacy of antioxidants and zinc combination in delaying AMD progression. Multiple gene regions including ESRRB-VASH1 on chromosome 14 have been identified with subgroups showing differential efficacy. We further validate our findings in an independent subsequent RCT, AREDS2, by discovering consistent differential treatment responses in the targeted and non-targeted subgroups identified from AREDS. This multiple-testing-based simultaneous inference approach provides a step forward to confidently identify and infer subgroups in modern drug development.
Collapse
Affiliation(s)
- Yue Wei
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Jason C Hsu
- Department of Statistics, The Ohio State University, Columbus, Ohio, USA
| | - Wei Chen
- Department of Pediatrics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Emily Y Chew
- National Eye Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Ying Ding
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
17
|
Strahler J, Wachten H, Stark R, Walter B. Alike and different: Associations between orthorexic eating behaviors and exercise addiction. Int J Eat Disord 2021; 54:1415-1425. [PMID: 33955559 DOI: 10.1002/eat.23525] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Revised: 04/09/2021] [Accepted: 04/10/2021] [Indexed: 01/09/2023]
Abstract
OBJECTIVE Symptoms of exercise addiction, a state of compulsively engaging in intense exercise, and orthorexic eating attitudes, the obsession with eating only healthy foods, often occur together. It is assumed that some more general psychological traits underlie this association. Main aim of this report was to examine similarities and differences between orthorexic eating and addictive exercising. METHOD Six hundred and eight individuals completed an online survey (mean age: 27.5, SD = 11.0 years; 76.5% women) measuring exercise addiction (Exercise Addiction inventory, EAI), orthorexic eating (Düsseldorfer Orthorexie Skala, DOS), personality domains (Big-Five Inventory-10), anxiety and depression (Hospital Anxiety and Depression Scale). RESULTS Correlations between the DOS and EAI were .43 in women and .62 in men. Structural equation models identified gender-specific as well as behavior-specific psychological correlates. Among women, anxiety correlated with both EAI and DOS. In addition, the DOS correlated with depression and neuroticism while the EAI correlated with conscientiousness. In men, both scales were associated with conscientiousness and the EAI also correlated with extraversion. Clusterability analysis provided no evidence for clusters based on DOS and EAI. DISCUSSION Present results showed a substantial correlation between addictive exercising and orthorexic eating, however, coefficients were smaller than expected and appeared higher in men. Both behaviors shared few psychological traits (anxiety in women, conscientiousness in men) thereby questioning the assumption of a similar origin. Additionally, gender-specific psychological correlates point to the need for different disease management approaches in women and men.
Collapse
Affiliation(s)
- Jana Strahler
- Department of Psychotherapy and Systems Neuroscience, Faculty of Psychology and Sport Science, University of Giessen, Giessen, Germany
| | - Hanna Wachten
- Department of Psychotherapy and Systems Neuroscience, Faculty of Psychology and Sport Science, University of Giessen, Giessen, Germany
| | - Rudolf Stark
- Department of Psychotherapy and Systems Neuroscience, Faculty of Psychology and Sport Science, University of Giessen, Giessen, Germany.,Center for Mind, Brain and Behavior, Philipps University Marburg and Justus Liebig University Giessen, Germany
| | - Bertram Walter
- Department of Psychotherapy and Systems Neuroscience, Faculty of Psychology and Sport Science, University of Giessen, Giessen, Germany.,Center for Mind, Brain and Behavior, Philipps University Marburg and Justus Liebig University Giessen, Germany
| |
Collapse
|
18
|
Talisa VB, Chang CCH. Learning and confirming a class of treatment responders in clinical trials. Stat Med 2021; 40:4872-4889. [PMID: 34121214 DOI: 10.1002/sim.9100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 05/08/2021] [Accepted: 05/27/2021] [Indexed: 11/09/2022]
Abstract
Clinical trials require substantial effort and time to complete, and regulatory agencies may require two successful efficacy trials before approving a new drug. One way to improve the chance of follow-up success is to identify a subpopulation among whom treatment effects are estimated to be beneficial, and enrolling future studies from this subpopulation. In this article we study confirmable responder class (CRC) learning, where the objective is to learn in a random half of the dataset (training set) a subpopulation among whom the predicted conditional ATE (CATE) suggests clinically meaningful benefit, with maximum power of being confirmed via hypothesis test in the other half (test set). We studied a set of CRC learners across simulated datasets in which either all patients benefited, or only some did. Performance metrics included the rate of confirmation in the test set, and the classification accuracy of the CRC compared with the group with true CATE>0. In trials where all patients benefitted, only two learners were able to consistently identify most of the population as the CRC. One of these also performed especially well when only some patients benefitted, having relatively high confirmation rates, and showing robustness to the dimension of the covariate vector and population characteristics. This learner is based on cross-validation, and is a possible avenue for further development of model selection criteria for CRC learning. We also show that the performance of all methods can be improved by using both halves of the original dataset as training and test sets in turn.
Collapse
Affiliation(s)
- Victor B Talisa
- Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.,Clinical Research, Investigation, and Systems Modeling of Acute illness (CRISMA) Center in the Department of Critical Care Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| | - Chung-Chou H Chang
- Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.,Clinical Research, Investigation, and Systems Modeling of Acute illness (CRISMA) Center in the Department of Critical Care Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA.,Department of Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
19
|
Li J, Li Y, Jin B, Kosorok MR. Multithreshold change plane model: Estimation theory and applications in subgroup identification. Stat Med 2021; 40:3440-3459. [PMID: 33843100 DOI: 10.1002/sim.8976] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 01/06/2021] [Accepted: 03/21/2021] [Indexed: 11/05/2022]
Abstract
We propose a multithreshold change plane regression model which naturally partitions the observed subjects into subgroups with different covariate effects. The underlying grouping variable is a linear function of observed covariates and thus multiple thresholds produce change planes in the covariate space. We contribute a novel two-stage estimation approach to determine the number of subgroups, the location of thresholds, and all other regression parameters. In the first stage we adopt a group selection principle to consistently identify the number of subgroups, while in the second stage change point locations and model parameter estimates are refined by a penalized induced smoothing technique. Our procedure allows sparse solutions for relatively moderate- or high-dimensional covariates. We further establish the asymptotic properties of our proposed estimators under appropriate technical conditions. We evaluate the performance of the proposed methods by simulation studies and provide illustrations using two medical data examples. Our proposal for subgroup identification may lead to an immediate application in personalized medicine.
Collapse
Affiliation(s)
- Jialiang Li
- Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore.,Duke-NUS Graduate Medical School, National University of Singapore, Singapore, Singapore.,Singapore Eye Research Institute, Singapore, Singapore
| | - Yaguang Li
- International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui, China
| | - Baisuo Jin
- International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui, China
| | - Michael R Kosorok
- Department of Biotatistics, University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|
20
|
Kang D, S Coffey C, J Smith B, Yuan Y, Shi Q, Yin J. Hierarchical Bayesian clustering design of multiple biomarker subgroups (HCOMBS). Stat Med 2021; 40:2893-2921. [PMID: 33772843 DOI: 10.1002/sim.8946] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 12/21/2020] [Accepted: 02/20/2021] [Indexed: 12/14/2022]
Abstract
Given the Food and Drug Administration's (FDA's) acceptance of master protocol designs in recent guidance documents, the oncology field is rapidly moving to address the paradigm shift to molecular subtype focused studies. Identifying new "marker-based" treatments requires new methodologies to address the growing demand to conduct clinical trials in smaller molecular subpopulations, identify effective treatment and marker interactions, and control for false positives. We introduce our methodology, Hierarchical Bayesian Clustering Design of Multiple Biomarker Subgroups (HCOMBS), a two-stage umbrella Phase II design with effect size clustering and information borrowing across multiple biomarker-treatment pairs. HCOMBS was designed to reduce required sample size, differentiate between varying effect sizes, and control for operating characteristics in the multi-arm setting. When compared to independently applied Simon's Optimal two-stage design, we showed through simulations that HCOMBS required less participants per treatment arm with a well-controlled family-wise error rate and desirable marginal power. Additionally, HCOMBS features a statistical approach that simultaneously conducts clustering and hypothesis testing in one step. We also applied the proposed design on the alliance brain metastases umbrella trial.
Collapse
Affiliation(s)
- Daniel Kang
- Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, Iowa, USA
| | - Christopher S Coffey
- Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, Iowa, USA
| | - Brian J Smith
- Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, Iowa, USA
| | - Ying Yuan
- Department of Biostatistics, MD Anderson Cancer Center, Houston, Texas, USA
| | - Qian Shi
- Division of Clinical Trials and Biostatistics, Mayo Clinic, Rochester, MN, USA
| | - Jun Yin
- Division of Clinical Trials and Biostatistics, Mayo Clinic, Rochester, MN, USA
| |
Collapse
|
21
|
韦 红, 康 佩, 刘 颖, 黄 福, 陈 征, 安 胜. [ Subgroup identification based on accelerated failure time model combined with adaptive elastic net]. Nan Fang Yi Ke Da Xue Xue Bao 2021; 41:391-398. [PMID: 33849830 PMCID: PMC8075779 DOI: 10.12122/j.issn.1673-4254.2021.03.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Indexed: 11/24/2022]
Abstract
OBJECTIVE To solve the problem of identifying subgroup in a randomized clinical trial with respect to survival time, we present a strategy based on accelerated failure time model to identify the subgroup with an enhanced treatment effect. OBJECTIVE We fitted and compared univariate accelerated failure time (AFT) models and penalized AFT models regularized by adaptive elastic net to identify the candidate covariates. Based on these covariates, we utilized change-point algorithm to classify the patient subgroups. A two-stage adaptive design was adopted to verify the treatment effect in certain subgroups. Simulations were conducted across different scenarios to evaluate the performance of the models. OBJECTIVE As the correlation between covariates differed, the power of the models with an adaptive design was stable. In the two-stage adaptive design, the power of the models was the highest when the two significance levels (α1 and α2) were allocated to be 0.035 and 0.015, respectively. A better effect of the responder subgroup was associated with a higher power of the models. For a fixed sample size, the power decreased as the covariate number to sample size ratio increased, but the power showed a stable trend when the ratio was above 1. The univariate models showed different distribution patterns of the parameters for different survival time, while their distribution was relatively stable in the penalized AFT models. OBJECTIVE The correlation between the covariates does not affect the performance of univariate AFT models and penalized AFT models. (0.035, 0.015) can be used as a reference for the significance level of an adaptive design. For small differences in the treatment effect between the responder and the non-responder, the penalized AFT model including the main effect of covariate (Penalized, Eq_in) outperforms the univariate AFT model excluding the main effect of covariate (Univariate, Eq_ex). Univariate, Eq_ex performs better when the covariate number to sample size ratio is less than 1, but is outperformed by Penalized, Eq_in when the ratio is above 1. The parameter distribution of survival time has a greater impact on the univariate models but a smaller impact on the penalized models.
Collapse
Affiliation(s)
- 红霞 韦
- />南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
| | - 佩 康
- />南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
| | - 颖欣 刘
- />南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
| | - 福强 黄
- />南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
| | - 征 陈
- />南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
| | - 胜利 安
- />南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
| |
Collapse
|
22
|
Wei Y, Wang X, Chew EY, Ding Y. Confident identification of subgroups from SNP testing in RCTs with binary outcomes. Biom J 2021; 64:256-271. [PMID: 33751636 DOI: 10.1002/bimj.202000170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 10/20/2020] [Accepted: 12/03/2020] [Indexed: 11/11/2022]
Abstract
In modern drug development, genotype information becomes more frequently collected in randomized controlled trials (RCTs) for individualized risk prediction and personalized medicine development. Finding single nucleotide polymorphisms (SNPs) that are predictive of differential treatment efficacy, measured by a clinical outcome, is fundamentally different and more challenging than the traditional association test for a quantitative trait. With the objective to confidently identify and infer genetic subgroups with enhanced treatment efficacy from a large RCT for an eye disease, age-related macular degeneration (AMD), where the clinical endpoint is binary (progressed or not), we propose a novel SNP-testing procedure for binary clinical outcomes. Specifically, we formulate four contrasts to simultaneously assess all possible genetic effects on a logic-respecting efficacy measure, the relative risk (between treatment and control). Our method controls both within- and across-SNP multiplicity rigorously. We then use real genotype data to perform chromosome-wide simulations to evaluate our method performance and to provide practical recommendations. Finally, we apply the proposed method to perform a genome-wide SNP testing for the AMD trial and successfully identify multiple gene regions with genetic subgroups exhibiting enhanced efficacy in terms of decreasing the AMD progression rate.
Collapse
Affiliation(s)
- Yue Wei
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Xinjun Wang
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Emily Y Chew
- National Eye Institute, NIH, Bethesda, Maryland, USA
| | - Ying Ding
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
23
|
Ruberg SJ. Assessing and communicating heterogeneity of treatment effects for patient subpopulations: The hardest problem there is. Pharm Stat 2021; 20:939-944. [PMID: 33655601 DOI: 10.1002/pst.2110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 11/24/2020] [Accepted: 02/15/2021] [Indexed: 11/09/2022]
Abstract
Heterogeneity is an enormously complex problem because there are so many dimensions and variables that can be considered when assessing which ones may influence an efficacy or safety outcome for an individual patient. This is difficult in randomized controlled trials and even more so in observational settings. An alternative approach is presented in which the individual patient becomes the "subgroup," and similar patients are identified in the clinical trial database or electronic medical record that can be used to predict how that individual patient may respond to treatment.
Collapse
|
24
|
Tiede KE, Schultheis SK, Meyer B. Subgroup Splits in Diverse Work Teams: Subgroup Perceptions but Not Demographic Faultlines Affect Team Identification and Emotional Exhaustion. Front Psychol 2021; 12:595720. [PMID: 33643128 PMCID: PMC7907170 DOI: 10.3389/fpsyg.2021.595720] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 01/15/2021] [Indexed: 11/20/2022] Open
Abstract
We investigate the relationship between (hypothetical) subgroup splits (i.e., faultlines), subjectively perceived subgroups, and team identification and emotional exhaustion. Based on the job demands-resources model and on self-categorization theory, we propose that faultline strength and perceived subgroups negatively affect emotional exhaustion, mediated by team identification. We further propose that subgroup identification moderates the mediation such that subgroup identification compensates low levels of team identification. We tested our hypotheses with a two-wave questionnaire study in a sample of 105 participants from 48 teams from various contexts. We found an effect of perceived subgroups on emotional exhaustion mediated by team identification, but no direct or indirect effect of faultline strength on emotional exhaustion. We also could not find that subgroup identification moderates the effect of team identification on emotional exhaustion. We discuss the need for further research on the link of subgroup splits in work teams and the rise of psychological health issues and derive that measures to prevent burnout should primarily focus on avoiding or reducing subgroup perception whereas affecting the actual demographic composition of work team should be of lower priority.
Collapse
Affiliation(s)
- Kevin E. Tiede
- Graduate School of Decision Sciences and Department of Psychology, University of Konstanz, Konstanz, Germany
| | | | - Bertolt Meyer
- Department of Psychology, Chemnitz University of Technology, Chemnitz, Germany
| |
Collapse
|
25
|
Zhang P, Ma J, Chen X, Shentu Y. A nonparametric method for value function guided subgroup identification via gradient tree boosting for censored survival data. Stat Med 2020; 39:4133-4146. [PMID: 32786155 DOI: 10.1002/sim.8714] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2020] [Revised: 06/08/2020] [Accepted: 07/09/2020] [Indexed: 11/07/2022]
Abstract
In randomized clinical trials with survival outcome, there has been an increasing interest in subgroup identification based on baseline genomic, proteomic markers, or clinical characteristics. Some of the existing methods identify subgroups that benefit substantially from the experimental treatment by directly modeling outcomes or treatment effect. When the goal is to find an optimal treatment for a given patient rather than finding the right patient for a given treatment, methods under the individualized treatment regime framework estimate an individualized treatment rule that would lead to the best expected clinical outcome as measured by a value function. Connecting the concept of value function to subgroup identification, we propose a nonparametric method that searches for subgroup membership scores by maximizing a value function that directly reflects the subgroup-treatment interaction effect based on restricted mean survival time. A gradient tree boosting algorithm is proposed to search for the individual subgroup membership scores. We conduct simulation studies to evaluate the performance of the proposed method and an application to an AIDS clinical trial is performed for illustration.
Collapse
Affiliation(s)
- Pingye Zhang
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Junshui Ma
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Xinqun Chen
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Yue Shentu
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| |
Collapse
|
26
|
Schnell PM. Monte Carlo approaches to frequentist multiplicity-adjusted benefiting subgroup identification. Stat Methods Med Res 2020; 30:1026-1041. [PMID: 33256562 DOI: 10.1177/0962280220973705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
One common goal of subgroup analyses is to determine the subgroup of the population for which a given treatment is effective. Like most problems in subgroup analyses, this benefiting subgroup identification requires careful attention to multiple testing considerations, especially Type I error inflation. To partially address these concerns, the credible subgroups approach provides a pair of bounding subgroups for the benefiting subgroup, constructed so that with high posterior probability one is contained by the benefiting subgroup while the other contains the benefiting subgroup. To date, this approach has been presented within the Bayesian paradigm only, and requires sampling from the posterior of a Bayesian model. Additionally, in many cases, such as regulatory submission, guarantees of frequentist operating characteristics are helpful or necessary. We present Monte Carlo approaches to constructing confidence subgroups, frequentist analogues to credible subgroups that replace the posterior distribution with an estimate of the joint distribution of personalized treatment effect estimates, and yield frequentist interpretations and coverage guarantees. The estimated joint distribution is produced using either draws from asymptotic sampling distributions of estimated model parameters, or bootstrap resampling schemes. The approach is applied to a publicly available dataset from randomized trials of Alzheimer's disease treatments.
Collapse
Affiliation(s)
- Patrick M Schnell
- Division of Biostatistics, The 2647Ohio State University College of Public Health, Ohio, USA
| |
Collapse
|
27
|
Abstract
The mixture cure model has been widely applied to survival data in which a fraction of the observations never experience the event of interest, despite long-term follow-up. In this paper, we study the Cox proportional hazards mixture cure model where the covariate effects on the distribution of uncured subjects' failure time may jump when a covariate exceeds a change point. The nonparametric maximum likelihood estimation is used to obtain the semiparametric estimates. We employ a two-step computational procedure involving the Expectation-Maximization algorithm to implement the estimation. The consistency, convergence rate and asymptotic distributions of the estimators are carefully established under technical conditions and we show that the change point estimator is n consistency. The m out of n bootstrap and the Louis algorithm are used to obtain the standard errors of the estimated change point and other regression parameter estimates, respectively. We also contribute a test procedure to check the existence of the change point. The finite sample performance of the proposed method is demonstrated via simulation studies and real data examples.
Collapse
Affiliation(s)
- Bing Wang
- School of Mathematical Sciences, Dalian University of Technology, China
| | - Jialiang Li
- Department of Statistics and Applied Probability, Duke University NUS Graduate Medical School, Singapore Eye Research Institute, National University of Singapore, Singapore, Singapore
| | - Xiaoguang Wang
- School of Mathematical Sciences, Dalian University of Technology, China
| |
Collapse
|
28
|
Abstract
In recent years, subgroup analysis has emerged as an important tool to identify unknown subgroup memberships. However, subgroup analysis is still under-studied for longitudinal data. In this paper, we propose a structured mixed-effects approach for longitudinal data to model subgroup distribution and identify subgroup membership simultaneously. In the proposed structured mixed-effects model, the heterogeneous treatment effect is modeled as a random effect from a two-component mixture model, while the membership of the mixture model is incorporated using a logistic model with respect to some covariates. One advantage of our approach is that we are able to derive the estimation of the treatment effects through an EM-type algorithm which keeps the subgroup membership unchanged over time. Our numerical studies and real data example demonstrate that the proposed model outperforms other competing methods.
Collapse
Affiliation(s)
- Juan Shen
- Department of Statistics, Fudan University , Shanghai, China
| | - Annie Qu
- Department of Statistics, University of California at Irvine , Irvine, California, USA
| |
Collapse
|
29
|
Wang J, Li J, Li Y, Wong WK. A model-based multithreshold method for subgroup identification. Stat Med 2019; 38:2605-2631. [PMID: 30887552 DOI: 10.1002/sim.8136] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Revised: 01/29/2019] [Accepted: 02/11/2019] [Indexed: 11/07/2022]
Abstract
Thresholding variable plays a crucial role in subgroup identification for personalized medicine. Most existing partitioning methods split the sample based on one predictor variable. In this paper, we consider setting the splitting rule from a combination of multivariate predictors, such as the latent factors, principle components, and weighted sum of predictors. Such a subgrouping method may lead to more meaningful partitioning of the population than using a single variable. In addition, our method is based on a change point regression model and thus yields straight forward model-based prediction results. After choosing a particular thresholding variable form, we apply a two-stage multiple change point detection method to determine the subgroups and estimate the regression parameters. We show that our approach can produce two or more subgroups from the multiple change points and identify the true grouping with high probability. In addition, our estimation results enjoy oracle properties. We design a simulation study to compare performances of our proposed and existing methods and apply them to analyze data sets from a Scleroderma trial and a breast cancer study.
Collapse
Affiliation(s)
- Jingli Wang
- Department of Statistics and Applied Probability, National University of Singapore, Singapore
| | - Jialiang Li
- Department of Statistics and Applied Probability, National University of Singapore, Singapore.,Duke University-NUS Graduate Medical School, Singapore.,Singapore Eye Research Institute, Singapore
| | - Yaguang Li
- University of Science and Technology of China, Hefei, China
| | - Weng Kee Wong
- Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, California
| |
Collapse
|
30
|
张 燕, 李 雪, 王 志, 安 胜. [ Subgroup identification based on the Logistic model]. Nan Fang Yi Ke Da Xue Xue Bao 2018; 38:1503-1508. [PMID: 30613021 PMCID: PMC6744210 DOI: 10.12122/j.issn.1673-4254.2018.12.17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/04/2018] [Indexed: 02/05/2023]
Abstract
We propose a subgroup identification method based on the Logistic model for data from a two-arm clinical trial with dichotomous outcome variables.In this method, binary Logistic regression models are established for each group to calculate the outcome probabilities of each patient for comparison.According to the established rules, the patients are classified into their corresponding subgroups to establish a multinomial Logistic regression model.We simulated the false rate, correct judgment rate, coincidence rate and model correct judgment rate for different sample sizes and carried out an example analysis.The results of simulation showed that for different sample sizes, the false rates of this method were below 0.07 and the correct judgment rates were all above 0.75 with adequate coincidence rates and model correct judgment rates, demonstrating the effectiveness and reliability of the proposed method for subgroup identification.
Collapse
Affiliation(s)
- 燕虹 张
- 南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
- 汕头大学医学院,广东 汕头 515041Shantou University Medical College, Shantou 515041, China
| | - 雪媛 李
- 广州医科大学附属第三医院妇产科,广东 广州 510515Department of Gynecology and Obstetrics, Third Affiliated Hospital of Guangzhou Medical University, Guangzhou 510150, China
| | - 志坚 王
- 南方医院妇产科,广东 广州 510515Department of Gynecology and Obstetrics, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China
| | - 胜利 安
- 南方医科大学公共卫生学院生物统计学系,广东 广州 510515Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
- 安胜利,副教授,博士,电话:020-61648319,E-mail:
| |
Collapse
|
31
|
Abstract
Background A recent focus in the health sciences has been the development of personalized medicine, which includes determining the population for which a given treatment is effective. Due to limited data, identifying the true benefiting population is a challenging task. To tackle this difficulty, the credible subgroups approach provides a pair of bounding subgroups for the true benefiting subgroup, constructed so that one is contained by the benefiting subgroup while the other contains the benefiting subgroup with high probability. However, the method has so far only been developed for parametric linear models. Methods In this article, we develop the details required to follow the credible subgroups approach in more realistic settings by considering nonlinear and semiparametric regression models, supported for regulatory science by conditional power simulations. We also present an improved multiple testing approach using a step-down procedure. We evaluate our approach via simulations and apply it to data from four trials of Alzheimer's disease treatments carried out by AbbVie. Results Semiparametric modeling yields credible subgroups that are more robust to violations of linear treatment effect assumptions, and careful choice of the population of interest as well as the step-down multiple testing procedure result in a higher rate of detection of benefiting types of patients. The approach allows us to identify types of patients that benefit from treatment in the Alzheimer's disease trials. Conclusion Attempts to identify benefiting subgroups of patients in clinical trials are often met with skepticism due to a lack of multiplicity control and unrealistically restrictive assumptions. Our proposed approach merges two techniques, credible subgroups, and semiparametric regression, which avoids these problems and makes benefiting subgroup identification practical and reliable.
Collapse
Affiliation(s)
- Patrick M Schnell
- 1 Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA
| | - Peter Müller
- 2 Department of Mathematics, The University of Texas at Austin, Austin, TX, USA
| | - Qi Tang
- 3 Former employee of AbbVie, AbbVie, North Chicago, IL, USA
- 4 Sanofi, Bridgewater, NJ, USA
| | - Bradley P Carlin
- 5 Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
32
|
van den Berge MJC, Free RH, Arnold R, de Kleine E, Hofman R, van Dijk JMC, van Dijk P. Cluster Analysis to Identify Possible Subgroups in Tinnitus Patients. Front Neurol 2017; 8:115. [PMID: 28421030 PMCID: PMC5377919 DOI: 10.3389/fneur.2017.00115] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2016] [Accepted: 03/14/2017] [Indexed: 11/13/2022] Open
Abstract
Introduction In tinnitus treatment, there is a tendency to shift from a “one size fits all” to a more individual, patient-tailored approach. Insight in the heterogeneity of the tinnitus spectrum might improve the management of tinnitus patients in terms of choice of treatment and identification of patients with severe mental distress. The goal of this study was to identify subgroups in a large group of tinnitus patients. Methods Data were collected from patients with severe tinnitus complaints visiting our tertiary referral tinnitus care group at the University Medical Center Groningen. Patient-reported and physician-reported variables were collected during their visit to our clinic. Cluster analyses were used to characterize subgroups. For the selection of the right variables to enter in the cluster analysis, two approaches were used: (1) variable reduction with principle component analysis and (2) variable selection based on expert opinion. Results Various variables of 1,783 tinnitus patients were included in the analyses. Cluster analysis (1) included 976 patients and resulted in a four-cluster solution. The effect of external influences was the most discriminative between the groups, or clusters, of patients. The “silhouette measure” of the cluster outcome was low (0.2), indicating a “no substantial” cluster structure. Cluster analysis (2) included 761 patients and resulted in a three-cluster solution, comparable to the first analysis. Again, a “no substantial” cluster structure was found (0.2). Conclusion Two cluster analyses on a large database of tinnitus patients revealed that clusters of patients are mostly formed by a different response of external influences on their disease. However, both cluster outcomes based on this dataset showed a poor stability, suggesting that our tinnitus population comprises a continuum rather than a number of clearly defined subgroups.
Collapse
Affiliation(s)
- Minke J C van den Berge
- Department of Otorhinolaryngology/Head and Neck Surgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands.,Graduate School of Medical Sciences (Research School of Behavioural and Cognitive Neurosciences), University of Groningen, Groningen, Netherlands
| | - Rolien H Free
- Department of Otorhinolaryngology/Head and Neck Surgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands.,Graduate School of Medical Sciences (Research School of Behavioural and Cognitive Neurosciences), University of Groningen, Groningen, Netherlands
| | - Rosemarie Arnold
- Department of Otorhinolaryngology/Head and Neck Surgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Emile de Kleine
- Department of Otorhinolaryngology/Head and Neck Surgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Rutger Hofman
- Department of Otorhinolaryngology/Head and Neck Surgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - J Marc C van Dijk
- Graduate School of Medical Sciences (Research School of Behavioural and Cognitive Neurosciences), University of Groningen, Groningen, Netherlands.,Department of Neurosurgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Pim van Dijk
- Department of Otorhinolaryngology/Head and Neck Surgery, University of Groningen, University Medical Center Groningen, Groningen, Netherlands.,Graduate School of Medical Sciences (Research School of Behavioural and Cognitive Neurosciences), University of Groningen, Groningen, Netherlands
| |
Collapse
|
33
|
Huang X, Sun Y, Trow P, Chatterjee S, Chakravartty A, Tian L, Devanarayan V. Patient subgroup identification for clinical drug development. Stat Med 2017; 36:1414-1428. [PMID: 28147447 DOI: 10.1002/sim.7236] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Revised: 11/22/2016] [Accepted: 01/05/2017] [Indexed: 12/26/2022]
Abstract
Causal mechanism of relationship between the clinical outcome (efficacy or safety endpoints) and putative biomarkers, clinical baseline, and related predictors is usually unknown and must be deduced empirically from experimental data. Such relationships enable the development of tailored therapeutics and implementation of a precision medicine strategy in clinical trials to help stratify patients in terms of disease progression, clinical response, treatment differentiation, and so on. These relationships often require complex modeling to develop the prognostic and predictive signatures. For the purpose of easier interpretation and implementation in clinical practice, defining a multivariate biomarker signature in terms of thresholds (cutoffs/cut points) on individual biomarkers is preferable. In this paper, we propose some methods for developing such signatures in the context of continuous, binary and time-to-event endpoints. Results from simulations and case study illustration are also provided. Copyright © 2017 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Xin Huang
- AbbVie, Inc., North Chicago, IL, U.S.A
| | - Yan Sun
- AbbVie, Inc., North Chicago, IL, U.S.A
| | - Paul Trow
- AbbVie, Inc., North Chicago, IL, U.S.A
| | | | | | - Lu Tian
- Stanford University School of Medicine, Palo Alto, CA, U.S.A
| | | |
Collapse
|
34
|
Abstract
This article focuses on a broad class of statistical and clinical considerations related to the assessment of treatment effects across patient subgroups in late-stage clinical trials. This article begins with a comprehensive review of clinical trial literature and regulatory guidelines to help define scientifically sound approaches to evaluating subgroup effects in clinical trials. All commonly used types of subgroup analysis are considered in the article, including different variations of prospectively defined and post-hoc subgroup investigations. In the context of confirmatory subgroup analysis, key design and analysis options are presented, which includes conventional and innovative trial designs that support multi-population tailoring approaches. A detailed summary of exploratory subgroup analysis (with the purpose of either consistency assessment or subgroup identification) is also provided. The article promotes a more disciplined approach to post-hoc subgroup identification and formulates key principles that support reliable evaluation of subgroup effects in this setting.
Collapse
Affiliation(s)
- Alex Dmitrienko
- a Center for Statistics in Drug Development, Quintiles , Overland Park , Kansas , USA
| | | | - Arno Fritsch
- c Clinical Statistics , Bayer HealthCare , Wuppertal , Germany
| | - Ilya Lipkovich
- a Center for Statistics in Drug Development, Quintiles , Overland Park , Kansas , USA
| |
Collapse
|
35
|
Abstract
Statistical principles and ongoing proliferation of novel statistical methodologies have dramatically improved the clinical drug development process. This journey over the last seven decades reshaped the pharmaceutical industry and regulatory agencies, highlighted the importance of statistical thinking in drug development and decision-making, and, most importantly, improved the lives of countless patients around the world. Some significant highlights in the history of this journey are recounted here as well as some exciting opportunities of what the future may hold for the science and profession of statistics.
Collapse
Affiliation(s)
- Stephen J Ruberg
- a Lilly Corporate Center, Global Statistical Sciences , Eli Lilly & Company , Indianapolis , Indiana , USA
| |
Collapse
|
36
|
Fu H, Zhou J, Faries DE. Estimating optimal treatment regimes via subgroup identification in randomized control trials and observational studies. Stat Med 2016; 35:3285-302. [PMID: 26892174 DOI: 10.1002/sim.6920] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Revised: 01/15/2016] [Accepted: 02/01/2016] [Indexed: 11/12/2022]
Abstract
With new treatments and novel technology available, personalized medicine has become an important piece in the new era of medical product development. Traditional statistics methods for personalized medicine and subgroup identification primarily focus on single treatment or two arm randomized control trials. Motivated by the recent development of outcome weighted learning framework, we propose an alternative algorithm to search treatment assignments which has a connection with subgroup identification problems. Our method focuses on applications from clinical trials to generate easy to interpret results. This framework is able to handle two or more than two treatments from both randomized control trials and observational studies. We implement our algorithm in C++ and connect it with R. Its performance is evaluated by simulations, and we apply our method to a dataset from a diabetes study. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Haoda Fu
- Eli Lilly and Company, Lilly Corporate Center, Indianapolis, 46285, IN, U.S.A
| | - Jin Zhou
- Biostatistics Department, University of Arizona, Tucson, AZ, 85721, U.S.A
| | - Douglas E Faries
- Eli Lilly and Company, Lilly Corporate Center, Indianapolis, 46285, IN, U.S.A
| |
Collapse
|
37
|
Ondra T, Dmitrienko A, Friede T, Graf A, Miller F, Stallard N, Posch M. Methods for identification and confirmation of targeted subgroups in clinical trials: A systematic review. J Biopharm Stat 2016; 26:99-119. [PMID: 26378339 PMCID: PMC4732423 DOI: 10.1080/10543406.2015.1092034] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 08/14/2015] [Indexed: 12/30/2022]
Abstract
Important objectives in the development of stratified medicines include the identification and confirmation of subgroups of patients with a beneficial treatment effect and a positive benefit-risk balance. We report the results of a literature review on methodological approaches to the design and analysis of clinical trials investigating a potential heterogeneity of treatment effects across subgroups. The identified approaches are classified based on certain characteristics of the proposed trial designs and analysis methods. We distinguish between exploratory and confirmatory subgroup analysis, frequentist, Bayesian and decision-theoretic approaches and, last, fixed-sample, group-sequential, and adaptive designs and illustrate the available trial designs and analysis strategies with published case studies.
Collapse
Affiliation(s)
- Thomas Ondra
- Center for Medical Statistics and Informatics, Medizinische Universität Wien, Vienna, Austria
| | - Alex Dmitrienko
- Center for Statistics in Drug Development, Quintiles, Overland Park, Kansas, USA
| | - Tim Friede
- Department of Medical Statistics, Universitaetsmedizin, Göttingen, Göttingen, Germany
| | - Alexandra Graf
- Center for Medical Statistics and Informatics, Medizinische Universität Wien, Vienna, Austria
| | - Frank Miller
- Statistiska institutionen, Stockholms Universitet, Stockholm, Sweden
| | - Nigel Stallard
- Department of Statistics and Epidemiology, University of Warwick, Coventry, UK
| | - Martin Posch
- Center for Medical Statistics and Informatics, Medizinische Universität Wien, Vienna, Austria
| |
Collapse
|
38
|
Lu TP, Chen JJ. Identification of drug-induced toxicity biomarkers for treatment determination. Pharm Stat 2015; 14:284-93. [PMID: 25914330 DOI: 10.1002/pst.1684] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 11/18/2014] [Accepted: 03/30/2015] [Indexed: 12/28/2022]
Abstract
Drug-induced organ toxicity (DIOT) that leads to the removal of marketed drugs or termination of candidate drugs has been a leading concern for regulatory agencies and pharmaceutical companies. In safety studies, the genomic assays are conducted after the treatment so that drug-induced adverse effects can occur. Two types of biomarkers are observed: biomarkers of susceptibility and biomarkers of response. This paper presents a statistical model to distinguish two types of biomarkers and procedures to identify susceptible subpopulations. The biomarkers identified are used to develop classification model to identify susceptible subpopulation. Two methods to identify susceptibility biomarkers were evaluated in terms of predictive performance in subpopulation identification, including sensitivity, specificity, and accuracy. Method 1 considered the traditional linear model with a variable-by-treatment interaction term, and Method 2 considered fitting a single predictor variable model using only treatment data. Monte Carlo simulation studies were conducted to evaluate the performance of the two methods and impact of the subpopulation prevalence, probability of DIOT, and sample size on the predictive performance. Method 2 appeared to outperform Method 1, which was due to the lack of power for testing the interaction effect. Important statistical issues and challenges regarding identification of preclinical DIOT biomarkers were discussed. In summary, identification of predictive biomarkers for treatment determination highly depends on the subpopulation prevalence. When the proportion of susceptible subpopulation is 1% or less, a very large sample size is needed to ensure observing sufficient number of DIOT responses for biomarker and/or subpopulation identifications.
Collapse
Affiliation(s)
- Tzu-Pin Lu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, Food and Drug Administration, Jefferson, AR, USA.,Department of Public Health Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei, Taiwan
| | - James J Chen
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, Food and Drug Administration, Jefferson, AR, USA
| |
Collapse
|
39
|
Hardin DS, Rohwer RD, Curtis BH, Zagar A, Chen L, Boye KS, Jiang HH, Lipkovich IA. Understanding heterogeneity in response to antidiabetes treatment: a post hoc analysis using SIDES, a subgroup identification algorithm. J Diabetes Sci Technol 2013; 7:420-30. [PMID: 23567001 PMCID: PMC3737644 DOI: 10.1177/193229681300700219] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
BACKGROUND Treatment response in patients with type 2 diabetes mellitus (T2DM) varies because of different genotypic and phenotypic characteristics. Results of clinical trials are based largely on aggregated estimates of treatment effect rather than individualized outcomes. This research assessed heterogeneity and differential treatment response using the subgroup identification based on differential effect search (SIDES) algorithm with data from a large multinational study. METHODS This was a retrospective analysis of the DURABLE trial, a randomized, open-label, two-arm, parallel study. The trial enrolled 2091 insulin-naïve T2DM patients aged 30 to 80 years. Patients received twice-daily insulin lispro mix 75/25 (LM75/25) or once-daily insulin glargine (insulin glargine). The SIDES methodology was used to find subgroups where the treatment effect was substantially different from what was observed in the full population of the clinical trial. A subgroup identification tool implementing the SIDES algorithm was used to examine data for 1092 patients (584 LM75/25 and 508 insulin glargine), achieving at-goal 12- or 24-week glycated hemoglobin A1c (A1C) (≤7.0%). RESULTS The overall at-goal population treatment difference (A1C reduction) was not clinically meaningful, but a clinically meaningful difference was observed (A1C reduction 2.31% ± 0.06% LM75/25 versus 2.01% ± 0.07% insulin glargine; p = .001) in patients with a baseline fasting insulin level >11.4 μIU/ml and age ≤56 years. CONCLUSION The observation that younger patients with higher levels of fasting insulin may benefit from a regimen that includes short-acting insulin targeting postprandial glycemia helps explain the heterogeneity in response and warrants further investigation.
Collapse
Affiliation(s)
- Dana S Hardin
- Eli Lilly and Company, Lilly Corporate Center, Indianapolis, IN 46285, USA.
| | | | | | | | | | | | | | | |
Collapse
|