1
|
Argyriou E, Gros D, Hernandez Tejada MA, Muzzy WA, Acierno R. A machine learning personalized treatment rule to optimize assignment to psychotherapies for grief among veterans. J Affect Disord 2024; 358:466-473. [PMID: 38718947 DOI: 10.1016/j.jad.2024.05.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 04/03/2024] [Accepted: 05/02/2024] [Indexed: 05/19/2024]
Abstract
BACKGROUND Complex grief patterns are associated with significant suffering, functional impairments, health and mental health problems, and increased healthcare use. This burden may be even more pronounced among veterans. Behavioral Activation and Therapeutic Exposure (BATE-G) and Cognitive Therapy for Grief (CT-G) are two evidence-based interventions for grief. The goal of this study was to use a precision medicine approach to develop a personalized treatment rule to optimize assignment among these psychotherapies. METHODS We analyzed data (N = 155) from a randomized clinical trial comparing BATE-G and CT-G. Outcome weighted learning was used to estimate an optimal personalized treatment rule. Baseline characteristics including demographics, social support, variables related to the death, and psychopathology dimensions were used as prescriptive factors of treatment assignment. RESULTS The estimated rule assigned 72 veterans to CT-G and 56 to BATE-G. Assigning participants according to this rule was estimated to lead to markedly lower mean grief level following 6 months from treatment compared to assigning everyone to either BATE-G (Vdopt - VBATE-G = -18.57 [95 % CI: -29.41, -7.72]) or CT-G (Vdopt - VBATE-G = -20.89 [95 % CI: -30.7, -11.07]) regardless of their characteristics. LIMITATIONS Participants were primarily male veterans, and identified with Black or White race. The estimated rule was not externally validated. CONCLUSION The estimated rule used relatively simple, easily accessible, client characteristics to personalize assignment to treatment using a precision medicine approach based on machine learning and causal inference. Upon further validation, such a rule can be easily implemented in clinical practice to prescriptively maximize treatment benefits.
Collapse
Affiliation(s)
- Evangelia Argyriou
- Department of Psychiatry & Behavioral Sciences, Medical University of South Carolina, United States; Department of Psychology, Indiana University Indianapolis, United States
| | - Daniel Gros
- Department of Psychiatry & Behavioral Sciences, Medical University of South Carolina, United States; Mental Health Service, Ralph H. Johnson VA Healthcare System, United States.
| | - Melba A Hernandez Tejada
- Faillace Department of Psychiatry, University of Texas Health Science Center at Houston, United States
| | - Wendy A Muzzy
- Department of Psychiatry & Behavioral Sciences, Medical University of South Carolina, United States; Mental Health Service, Ralph H. Johnson VA Healthcare System, United States
| | - Ronald Acierno
- Faillace Department of Psychiatry, University of Texas Health Science Center at Houston, United States
| |
Collapse
|
2
|
Zhang Y, Kreif N, Gc VS, Manca A. Machine Learning Methods to Estimate Individualized Treatment Effects for Use in Health Technology Assessment. Med Decis Making 2024:272989X241263356. [PMID: 39056320 DOI: 10.1177/0272989x241263356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2024]
Abstract
BACKGROUND Recent developments in causal inference and machine learning (ML) allow for the estimation of individualized treatment effects (ITEs), which reveal whether treatment effectiveness varies according to patients' observed covariates. ITEs can be used to stratify health policy decisions according to individual characteristics and potentially achieve greater population health. Little is known about the appropriateness of available ML methods for use in health technology assessment. METHODS In this scoping review, we evaluate ML methods available for estimating ITEs, aiming to help practitioners assess their suitability in health technology assessment. We present a taxonomy of ML approaches, categorized by key challenges in health technology assessment using observational data, including handling time-varying confounding and time-to event data and quantifying uncertainty. RESULTS We found a wide range of algorithms for simpler settings with baseline confounding and continuous or binary outcomes. Not many ML algorithms can handle time-varying or unobserved confounding, and at the time of writing, no ML algorithm was capable of estimating ITEs for time-to-event outcomes while accounting for time-varying confounding. Many of the ML algorithms that estimate ITEs in longitudinal settings do not formally quantify uncertainty around the point estimates. LIMITATIONS This scoping review may not cover all relevant ML methods and algorithms as they are continuously evolving. CONCLUSIONS Existing ML methods available for ITE estimation are limited in handling important challenges posed by observational data when used for cost-effectiveness analysis, such as time-to-event outcomes, time-varying and hidden confounding, or the need to estimate sampling uncertainty around the estimates. IMPLICATIONS ML methods are promising but need further development before they can be used to estimate ITEs for health technology assessments. HIGHLIGHTS Estimating individualized treatment effects (ITEs) using observational data and machine learning (ML) can support personalized treatment advice and help deliver more customized information on the effectiveness and cost-effectiveness of health technologies.ML methods for ITE estimation are mostly designed for handling confounding at baseline but not time-varying or unobserved confounding. The few models that account for time-varying confounding are designed for continuous or binary outcomes, not time-to-event outcomes.Not all ML methods for estimating ITEs can quantify the uncertainty of their predictions.Future work on developing ML that addresses the concerns summarized in this review is needed before these methods can be widely used in clinical and health technology assessment-like decision making.
Collapse
Affiliation(s)
| | - Noemi Kreif
- Centre for Health Economics, University of York, UK
- Department of Pharmacy, University of Washington, Seattle, USA
| | - Vijay S Gc
- School of Human and Health Sciences, University of Huddersfield, UK
| | - Andrea Manca
- Centre for Health Economics, University of York, UK
| |
Collapse
|
3
|
Lipkovich I, Svensson D, Ratitch B, Dmitrienko A. Modern approaches for evaluating treatment effect heterogeneity from clinical trials and observational data. Stat Med 2024. [PMID: 39054669 DOI: 10.1002/sim.10167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 05/28/2024] [Accepted: 06/21/2024] [Indexed: 07/27/2024]
Abstract
In this paper, we review recent advances in statistical methods for the evaluation of the heterogeneity of treatment effects (HTE), including subgroup identification and estimation of individualized treatment regimens, from randomized clinical trials and observational studies. We identify several types of approaches using the features introduced in Lipkovich et al (Stat Med 2017;36: 136-196) that distinguish the recommended principled methods from basic methods for HTE evaluation that typically rely on rules of thumb and general guidelines (the methods are often referred to as common practices). We discuss the advantages and disadvantages of various principled methods as well as common measures for evaluating their performance. We use simulated data and a case study based on a historical clinical trial to illustrate several new approaches to HTE evaluation.
Collapse
Affiliation(s)
- Ilya Lipkovich
- Advanced Analytics and Access Capabilities, Eli Lilly and Company, Indianapolis, Indiana, USA
| | - David Svensson
- Statistical Innovation, BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden
| | - Bohdana Ratitch
- Clinical Statistics and Analytics, Research & Development, Pharmaceuticals, Bayer Inc., Mississauga, Ontario, Canada
| | - Alex Dmitrienko
- Department of Biostatistics, Mediana, San Juan, Puerto Rico, USA
| |
Collapse
|
4
|
He Q, Zhang S, LeBlanc ML, Zhao YQ. Estimating individualized treatment rules by optimizing the adjusted probability of a longer survival. Stat Methods Med Res 2024:9622802241262525. [PMID: 39053567 DOI: 10.1177/09622802241262525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2024]
Abstract
Individualized treatment rules inform tailored treatment decisions based on the patient's information, where the goal is to optimize clinical benefit for the population. When the clinical outcome of interest is survival time, most of current approaches typically aim to maximize the expected time of survival. We propose a new criterion for constructing Individualized treatment rules that optimize the clinical benefit with survival outcomes, termed as the adjusted probability of a longer survival. This objective captures the likelihood of living longer with being on treatment, compared to the alternative, which provides an alternative and often straightforward interpretation to communicate with clinicians and patients. We view it as an alternative to the survival analysis standard of the hazard ratio and the increasingly used restricted mean survival time. We develop a new method to construct the optimal Individualized treatment rule by maximizing a nonparametric estimator of the adjusted probability of a longer survival for a decision rule. Simulation studies demonstrate the reliability of the proposed method across a range of different scenarios. We further perform data analysis using data collected from a randomized Phase III clinical trial (SWOG S0819).
Collapse
Affiliation(s)
- Qijia He
- Department of Statistics, University of Washington, Seattle, WA, USA
| | | | - Michael L LeBlanc
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Ying-Qi Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| |
Collapse
|
5
|
Hartman H, Schipper M, Kidwell K. A sequential, multiple assignment, randomized trial design with a tailoring function. Stat Med 2024. [PMID: 38973591 DOI: 10.1002/sim.10161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 05/30/2024] [Accepted: 06/17/2024] [Indexed: 07/09/2024]
Abstract
We present a trial design for sequential multiple assignment randomized trials (SMARTs) that use a tailoring function instead of a binary tailoring variable allowing for simultaneous development of the tailoring variable and estimation of dynamic treatment regimens (DTRs). We apply methods for developing DTRs from observational data: tree-based regression learning and Q-learning. We compare this to a balanced randomized SMART with equal re-randomization probabilities and a typical SMART design where re-randomization depends on a binary tailoring variable and DTRs are analyzed with weighted and replicated regression. This project addresses a gap in clinical trial methodology by presenting SMARTs where second stage treatment is based on a continuous outcome removing the need for a binary tailoring variable. We demonstrate that data from a SMART using a tailoring function can be used to efficiently estimate DTRs and is more flexible under varying scenarios than a SMART using a tailoring variable.
Collapse
Affiliation(s)
- Holly Hartman
- Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Matthew Schipper
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Kelley Kidwell
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
6
|
Yang B, Guo X, Loh JM, Wang Q, Wang Y. Learning optimal biomarker-guided treatment policy for chronic disorders. Stat Med 2024; 43:2765-2782. [PMID: 38700103 PMCID: PMC11178467 DOI: 10.1002/sim.10099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 03/17/2024] [Accepted: 04/20/2024] [Indexed: 05/05/2024]
Abstract
Electroencephalogram (EEG) provides noninvasive measures of brain activity and is found to be valuable for the diagnosis of some chronic disorders. Specifically, pre-treatment EEG signals in the alpha and theta frequency bands have demonstrated some association with antidepressant response, which is well-known to have a low response rate. We aim to design an integrated pipeline that improves the response rate of patients with major depressive disorder by developing a treatment policy guided by the resting state pre-treatment EEG recordings and other treatment effects modifiers. First, we design an innovative automatic site-specific EEG preprocessing pipeline to extract features with stronger signals than raw data. We then estimate the conditional average treatment effect (CATE) using causal forests and use a doubly robust technique to improve efficiency in the estimation of the average treatment effect. We present evidence of heterogeneity in the treatment effect and the modifying power of the EEG features, as well as a significant average treatment effect, a result that cannot be obtained with conventional methods. Finally, we employ an efficient policy learning algorithm to learn an optimal depth-2 treatment assignment decision tree and compare its performance with Q-Learning and outcome-weighted learning via simulation studies and an application to a large multi-site, double-blind, randomized controlled clinical trial, EMBARC.
Collapse
Affiliation(s)
- Bin Yang
- Department of Biostatistics, Columbia University, New York, New York, USA
| | - Xingche Guo
- Department of Biostatistics, Columbia University, New York, New York, USA
| | - Ji Meng Loh
- Department of Mathematical Sciences, New Jersey Institute of Technology, Newark, New Jersey, USA
| | - Qinxia Wang
- Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | - Yuanjia Wang
- Department of Biostatistics, Columbia University, New York, New York, USA
- Department of Psychiatry, Columbia University, New York, New York, USA
| |
Collapse
|
7
|
Bouvier F, Peyrot E, Balendran A, Ségalas C, Roberts I, Petit F, Porcher R. Do machine learning methods lead to similar individualized treatment rules? A comparison study on real data. Stat Med 2024; 43:2043-2061. [PMID: 38472745 DOI: 10.1002/sim.10059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 01/30/2024] [Accepted: 03/01/2024] [Indexed: 03/14/2024]
Abstract
Identifying patients who benefit from a treatment is a key aspect of personalized medicine, which allows the development of individualized treatment rules (ITRs). Many machine learning methods have been proposed to create such rules. However, to what extent the methods lead to similar ITRs, that is, recommending the same treatment for the same individuals is unclear. In this work, we compared 22 of the most common approaches in two randomized control trials. Two classes of methods can be distinguished. The first class of methods relies on predicting individualized treatment effects from which an ITR is derived by recommending the treatment evaluated to the individuals with a predicted benefit. In the second class, methods directly estimate the ITR without estimating individualized treatment effects. For each trial, the performance of ITRs was assessed by various metrics, and the pairwise agreement between all ITRs was also calculated. Results showed that the ITRs obtained via the different methods generally had considerable disagreements regarding the patients to be treated. A better concordance was found among akin methods. Overall, when evaluating the performance of ITRs in a validation sample, all methods produced ITRs with limited performance, suggesting a high potential for optimism. For non-parametric methods, this optimism was likely due to overfitting. The different methods do not lead to similar ITRs and are therefore not interchangeable. The choice of the method strongly influences for which patients a certain treatment is recommended, drawing some concerns about their practical use.
Collapse
Affiliation(s)
- Florie Bouvier
- Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), Université Paris Cité and Université Sorbonne Paris Nord, Paris, France
| | - Etienne Peyrot
- Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), Université Paris Cité and Université Sorbonne Paris Nord, Paris, France
| | - Alan Balendran
- Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), Université Paris Cité and Université Sorbonne Paris Nord, Paris, France
| | - Corentin Ségalas
- Bordeaux Population Health Research Center, Université de Bordeaux, Inserm, Bordeaux, France
| | - Ian Roberts
- Clinical Trials Unit, London School of Hygiene & Tropical Medicine, London, UK
| | - François Petit
- Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), Université Paris Cité and Université Sorbonne Paris Nord, Paris, France
| | - Raphaël Porcher
- Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), Université Paris Cité and Université Sorbonne Paris Nord, Paris, France
- Centre d'Épidémiologie Clinique, Assistance Publique-Hôpitaux de Paris, Hôtel-Dieu, Paris, France
| |
Collapse
|
8
|
Wang X, Lee H, Haaland B, Kerrigan K, Puri S, Akerley W, Shen J. A matching-based machine learning approach to estimating optimal dynamic treatment regimes with time-to-event outcomes. Stat Methods Med Res 2024; 33:794-806. [PMID: 38502008 DOI: 10.1177/09622802241236954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
Observational data (e.g. electronic health records) has become increasingly important in evidence-based research on dynamic treatment regimes, which tailor treatments over time to patients based on their characteristics and evolving clinical history. It is of great interest for clinicians and statisticians to identify an optimal dynamic treatment regime that can produce the best expected clinical outcome for each individual and thus maximize the treatment benefit over the population. Observational data impose various challenges for using statistical tools to estimate optimal dynamic treatment regimes. Notably, the task becomes more sophisticated when the clinical outcome of primary interest is time-to-event. Here, we propose a matching-based machine learning method to identify the optimal dynamic treatment regime with time-to-event outcomes subject to right-censoring using electronic health record data. In contrast to the established inverse probability weighting-based dynamic treatment regime methods, our proposed approach provides better protection against model misspecification and extreme weights in the context of treatment sequences, effectively addressing a prevalent challenge in the longitudinal analysis of electronic health record data. In simulations, the proposed method demonstrates robust performance across a range of scenarios. In addition, we illustrate the method with an application to estimate optimal dynamic treatment regimes for patients with advanced non-small cell lung cancer using a real-world, nationwide electronic health record database from Flatiron Health.
Collapse
Affiliation(s)
- Xuechen Wang
- Department of Population Health Sciences, Division of Biostatistics, University of Utah, Salt Lake City, UT, USA
| | - Hyejung Lee
- Department of Population Health Sciences, Division of Biostatistics, University of Utah, Salt Lake City, UT, USA
| | - Benjamin Haaland
- Department of Population Health Sciences, Division of Biostatistics, University of Utah, Salt Lake City, UT, USA
| | - Kathleen Kerrigan
- Department of Internal Medicine, Division of Oncology, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Sonam Puri
- Department of Internal Medicine, Division of Oncology, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Wallace Akerley
- Department of Internal Medicine, Division of Oncology, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Jincheng Shen
- Department of Population Health Sciences, Division of Biostatistics, University of Utah, Salt Lake City, UT, USA
| |
Collapse
|
9
|
Lofaro D, Amparore D, Perri A, Rago V, Piana A, Zaccone V, Morelli M, Bisegna C, Suraci PP, Conforti D, Porpiglia F, Di Dio M. Comparing Perioperative Complications of Off-Clamp versus On-Clamp Partial Nephrectomy for Renal Cancer Using a Novel Energy Balancing Weights Method. Life (Basel) 2024; 14:442. [PMID: 38672713 PMCID: PMC11050879 DOI: 10.3390/life14040442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 03/24/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Partial nephrectomy (PN) is the primary surgical method for renal tumor treatment, typically involving clamping the renal artery during tumor removal, leading to warm ischemia and potential renal function impairment. Off-clamp approaches have been explored to mitigate organ damage, yet few results have emerged about the possible effects on hemoglobin loss. Most evidence comes from retrospective studies using propensity score matching, known to be sensitive to PS model misspecification. The energy balancing weights (EBW) method offers an alternative method to address bias by focusing on balancing all the characteristics of covariate distribution. We aimed to compare on- vs. off-clamp techniques in PN using EB-weighted retrospective patient data. Out of 333 consecutive PNs (275/58 on/off-clamp ratio), the EBW method achieved balanced variables, notably tumor anatomy and staging. No significant differences were observed in the operative endpoints between on- and off-clamp techniques, although off-clamp PNs showed slight reductions in hemoglobin loss and renal function decline, albeit with slightly higher perioperative blood loss. Our findings support previous evidence, indicating comparable surgical outcomes between standard and off-clamp procedures, with the EBW method proving effective in balancing baseline variables in observational studies comparing interventions.
Collapse
Affiliation(s)
- Danilo Lofaro
- Department of Mathematics and Computer Science, University of Calabria, 87036 Rende, Italy;
| | - Daniele Amparore
- Division of Urology, Department of Oncology, School of Medicine, San Luigi Hospital, University of Turin, 10043 Orbassano, Italy; (D.A.); (A.P.); (F.P.)
| | - Anna Perri
- Department of Experimental and Clinical Medicine, Magna Graecia University, 88100 Catanzaro, Italy;
| | - Vittoria Rago
- Department of Pharmacy, Health and Nutritional Sciences, University of Calabria, 87036 Rende, Italy
| | - Alberto Piana
- Division of Urology, Department of Oncology, School of Medicine, San Luigi Hospital, University of Turin, 10043 Orbassano, Italy; (D.A.); (A.P.); (F.P.)
| | - Vincenzo Zaccone
- Division of Urology, Department of Surgery, Annunziata Hospital, 87100 Cosenza, Italy; (V.Z.); (M.D.D.)
| | - Michele Morelli
- Department of Obstetrics and Gynecology, Annunziata Hospital, 87100 Cosenza, Italy;
| | - Claudio Bisegna
- Unit of Urological Minimally Invasive Robotic Surgery and Renal Transplantation, Department of Experimental and Clinical Medicine, Careggi Hospital, University of Florence, 50134 Florence, Italy;
| | - Paolo Pietro Suraci
- Urology Unit, Department of Medical-Surgical Sciences and Biotechnologies, Faculty of Pharmacy and Medicine, Sapienza University of Rome, 04100 Latina, Italy;
| | - Domenico Conforti
- de-Health Lab, Department of Mechanical, Energetic and Management Engineering, University of Calabria, 87036 Rende, Italy;
| | - Francesco Porpiglia
- Division of Urology, Department of Oncology, School of Medicine, San Luigi Hospital, University of Turin, 10043 Orbassano, Italy; (D.A.); (A.P.); (F.P.)
| | - Michele Di Dio
- Division of Urology, Department of Surgery, Annunziata Hospital, 87100 Cosenza, Italy; (V.Z.); (M.D.D.)
| |
Collapse
|
10
|
Butzin-Dozier Z, Ji Y, Coyle J, Malenica I, McQuade ETR, Grembi JA, Platts-Mills JA, Houpt ER, Graham JP, Ali S, Rahman MZ, Alauddin M, Famida SL, Akther S, Hossen MS, Mutsuddi P, Shoab AK, Rahman M, Islam MO, Miah R, Taniuchi M, Liu J, Alauddin S, Stewart CP, Luby SP, Colford JM, Hubbard AE, Mertens AN, Lin A. Treatment Heterogeneity of Water, Sanitation, Hygiene, and Nutrition Interventions on Child Growth by Environmental Enteric Dysfunction and Pathogen Status for Young Children in Bangladesh. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.21.24304684. [PMID: 38585931 PMCID: PMC10996736 DOI: 10.1101/2024.03.21.24304684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Background Water, sanitation, hygiene (WSH), nutrition (N), and combined (N+WSH) interventions are often implemented by global health organizations, but WSH interventions may insufficiently reduce pathogen exposure, and nutrition interventions may be modified by environmental enteric dysfunction (EED), a condition of increased intestinal permeability and inflammation. This study investigated the heterogeneity of these treatments' effects based on individual pathogen and EED biomarker status with respect to child linear growth. Methods We applied cross-validated targeted maximum likelihood estimation and super learner ensemble machine learning to assess the conditional treatment effects in subgroups defined by biomarker and pathogen status. We analyzed treatment (N+WSH, WSH, N, or control) randomly assigned in-utero, child pathogen and EED data at 14 months of age, and child LAZ at 28 months of age. We estimated the difference in mean child length for age Z-score (LAZ) under the treatment rule and the difference in stratified treatment effect (treatment effect difference) comparing children with high versus low pathogen/biomarker status while controlling for baseline covariates. Results We analyzed data from 1,522 children, who had median LAZ of -1.56. We found that myeloperoxidase (N+WSH treatment effect difference 0.0007 LAZ, WSH treatment effect difference 0.1032 LAZ, N treatment effect difference 0.0037 LAZ) and Campylobacter infection (N+WSH treatment effect difference 0.0011 LAZ, WSH difference 0.0119 LAZ, N difference 0.0255 LAZ) were associated with greater effect of all interventions on growth. In other words, children with high myeloperoxidase or Campylobacter infection experienced a greater impact of the interventions on growth. We found that a treatment rule that assigned the N+WSH (LAZ difference 0.23, 95% CI (0.05, 0.41)) and WSH (LAZ difference 0.17, 95% CI (0.04, 0.30)) interventions based on EED biomarkers and pathogens increased predicted child growth compared to the randomly allocated intervention. Conclusions These findings indicate that EED biomarker and pathogen status, particularly Campylobacter and myeloperoxidase (a measure of gut inflammation), may be related to impact of N+WSH, WSH, and N interventions on child linear growth.
Collapse
Affiliation(s)
| | - Yunwen Ji
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | - Jeremy Coyle
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | - Ivana Malenica
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | | | - Jessica Anne Grembi
- Division of Infectious Diseases and Geographic Medicine, Stanford University, Stanford, CA USA
| | | | - Eric R. Houpt
- School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Jay P. Graham
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | - Shahjahan Ali
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Md Ziaur Rahman
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Mohammad Alauddin
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Syeda L. Famida
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Salma Akther
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Md. Saheen Hossen
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Palash Mutsuddi
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Abul K. Shoab
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Mahbubur Rahman
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Md. Ohedul Islam
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Rana Miah
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Mami Taniuchi
- School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Jie Liu
- School of Public Health, Qingdao University, Qingdao, China
| | | | | | - Stephen P. Luby
- Division of Infectious Diseases and Geographic Medicine, Stanford University, Stanford, CA USA
| | - John M. Colford
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | - Alan E. Hubbard
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | - Andrew N. Mertens
- School of Public Health, University of California, Berkeley, Berkeley, CA USA
| | - Audrie Lin
- Department of Microbiology and Environmental Toxicology, University of California, Santa Cruz, Santa Cruz, CA USA
| |
Collapse
|
11
|
Xie S, Ogden RT. Functional support vector machine. Biostatistics 2024:kxae007. [PMID: 38476094 DOI: 10.1093/biostatistics/kxae007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 12/26/2023] [Accepted: 02/13/2024] [Indexed: 03/14/2024] Open
Abstract
Linear and generalized linear scalar-on-function modeling have been commonly used to understand the relationship between a scalar response variable (e.g. continuous, binary outcomes) and functional predictors. Such techniques are sensitive to model misspecification when the relationship between the response variable and the functional predictors is complex. On the other hand, support vector machines (SVMs) are among the most robust prediction models but do not take account of the high correlations between repeated measurements and cannot be used for irregular data. In this work, we propose a novel method to integrate functional principal component analysis with SVM techniques for classification and regression to account for the continuous nature of functional data and the nonlinear relationship between the scalar response variable and the functional predictors. We demonstrate the performance of our method through extensive simulation experiments and two real data applications: the classification of alcoholics using electroencephalography signals and the prediction of glucobrassicin concentration using near-infrared reflectance spectroscopy. Our methods especially have more advantages when the measurement errors in functional predictors are relatively large.
Collapse
Affiliation(s)
- Shanghong Xie
- School of Statistics, Southwestern University of Finance and Economics, Chengdu, China
- Department of Biostatistics, Columbia University, New York, NY, United States
| | - R Todd Ogden
- Department of Biostatistics, Columbia University, New York, NY, United States
| |
Collapse
|
12
|
Zainal NH, Newman MG. Which client with generalized anxiety disorder benefits from a mindfulness ecological momentary intervention versus a self-monitoring app? Developing a multivariable machine learning predictive model. J Anxiety Disord 2024; 102:102825. [PMID: 38245961 PMCID: PMC10922999 DOI: 10.1016/j.janxdis.2024.102825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 12/26/2023] [Accepted: 01/02/2024] [Indexed: 01/23/2024]
Abstract
Precision medicine methods (machine learning; ML) can identify which clients with generalized anxiety disorder (GAD) benefit from mindfulness ecological momentary intervention (MEMI) vs. self-monitoring app (SM). We used randomized controlled trial data of MEMI vs. SM for GAD (N = 110) and tested three ML models to predict one-month follow-up reliable improvement in GAD severity, perseverative cognitions (PC), trait mindfulness (TM), and executive function (EF). Eleven baseline predictors were tested regarding differential reliable change from MEMI vs. SM (age, sex, race, EF errors, inhibitory dyscontrol, set-shifting deficits, verbal fluency, working memory, GAD severity, TM, PC). The final top five prescriptive predictor models of all outcomes performed well (AUC = .752 .886). The following variables predicted better outcome from MEMI vs. SM: Higher GAD severity predicted more GAD improvement but less EF improvement. Elevated PC, inhibitory dyscontrol, and verbal dysfluency predicted better improvement in most outcomes. Greater set-shifting and TM predicted stronger improvements in GAD symptoms and TM. Older age predicted more alleviation of GAD and PC symptoms. Women exhibited more enhancements in trait mindfulness and EF than men. White individuals benefitted more than non-White. PC, TM, EF, and sociodemographic data might help predictive models optimize intervention selection for GAD.
Collapse
Affiliation(s)
- Nur Hani Zainal
- Harvard Medical School, Boston, MA, USA; National University of Singapore, Kent Ridge, Singapore.
| | | |
Collapse
|
13
|
Cho H, She J, De Marchi D, El-Zaatari H, Barnes EL, Kahkoska AR, Kosorok MR, Virkud AV. Machine Learning and Health Science Research: Tutorial. J Med Internet Res 2024; 26:e50890. [PMID: 38289657 PMCID: PMC10865203 DOI: 10.2196/50890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 11/30/2023] [Accepted: 12/21/2023] [Indexed: 02/01/2024] Open
Abstract
Machine learning (ML) has seen impressive growth in health science research due to its capacity for handling complex data to perform a range of tasks, including unsupervised learning, supervised learning, and reinforcement learning. To aid health science researchers in understanding the strengths and limitations of ML and to facilitate its integration into their studies, we present here a guideline for integrating ML into an analysis through a structured framework, covering steps from framing a research question to study design and analysis techniques for specialized data types.
Collapse
Affiliation(s)
- Hunyong Cho
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Jane She
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Daniel De Marchi
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Helal El-Zaatari
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Edward L Barnes
- Division of Gastroenterology and Hepatology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Center for Gastrointestinal Biology and Diseases, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Anna R Kahkoska
- Department of Nutrition, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Division of Endocrinology and Metabolism, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Center for Aging and Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Arti V Virkud
- Kidney Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| |
Collapse
|
14
|
Freeman NLB, Browder SE, McGinigle KL, Kosorok MR. Individualized treatment rule characterization via a value function surrogate. Biometrics 2024; 80:ujad012. [PMID: 38372403 PMCID: PMC10875523 DOI: 10.1093/biomtc/ujad012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 10/19/2023] [Accepted: 11/14/2023] [Indexed: 02/20/2024]
Abstract
Precision medicine is a promising framework for generating evidence to improve health and health care. Yet, a gap persists between the ever-growing number of statistical precision medicine strategies for evidence generation and implementation in real-world clinical settings, and the strategies for closing this gap will likely be context-dependent. In this paper, we consider the specific context of partial compliance to wound management among patients with peripheral artery disease. Using a Gaussian process surrogate for the value function, we show the feasibility of using Bayesian optimization to learn optimal individualized treatment rules. Further, we expand beyond the common precision medicine task of learning an optimal individualized treatment rule to the characterization of classes of individualized treatment rules and show how those findings can be translated into clinical contexts.
Collapse
Affiliation(s)
- Nikki L B Freeman
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| | - Sydney E Browder
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| | - Katharine L McGinigle
- Department of Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| |
Collapse
|
15
|
Pedone M, Argiento R, Stingo FC. Personalized treatment selection via product partition models with covariates. Biometrics 2024; 80:ujad003. [PMID: 38364806 DOI: 10.1093/biomtc/ujad003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 07/27/2023] [Accepted: 11/03/2023] [Indexed: 02/18/2024]
Abstract
Precision medicine is an approach for disease treatment that defines treatment strategies based on the individual characteristics of the patients. Motivated by an open problem in cancer genomics, we develop a novel model that flexibly clusters patients with similar predictive characteristics and similar treatment responses; this approach identifies, via predictive inference, which one among a set of treatments is better suited for a new patient. The proposed method is fully model based, avoiding uncertainty underestimation attained when treatment assignment is performed by adopting heuristic clustering procedures, and belongs to the class of product partition models with covariates, here extended to include the cohesion induced by the normalized generalized gamma process. The method performs particularly well in scenarios characterized by considerable heterogeneity of the predictive covariates in simulation studies. A cancer genomics case study illustrates the potential benefits in terms of treatment response yielded by the proposed approach. Finally, being model based, the approach allows estimating clusters' specific response probabilities and then identifying patients more likely to benefit from personalized treatment.
Collapse
Affiliation(s)
- Matteo Pedone
- Department of Statistics, Computer Science and Applications, University of Florence, Florence, Italy, 50134
| | - Raffaele Argiento
- Department of Economics, University of Bergamo, Bergamo, Italy, 24121
| | - Francesco C Stingo
- Department of Statistics, Computer Science and Applications, University of Florence, Florence, Italy, 50134
| |
Collapse
|
16
|
Wang J, Doan LV, Axelrod D, Rotrosen J, Wang B, Park HG, Edwards RR, Curatolo M, Jackman C, Perez R. Optimizing the use of ketamine to reduce chronic postsurgical pain in women undergoing mastectomy for oncologic indication: study protocol for the KALPAS multicenter randomized controlled trial. Trials 2024; 25:67. [PMID: 38243266 PMCID: PMC10797799 DOI: 10.1186/s13063-023-07884-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 12/15/2023] [Indexed: 01/21/2024] Open
Abstract
BACKGROUND Mastectomies are commonly performed and strongly associated with chronic postsurgical pain (CPSP), more specifically termed postmastectomy pain syndrome (PMPS), with 25-60% of patients reporting pain 3 months after surgery. PMPS interferes with function, recovery, and compliance with adjuvant therapy. Importantly, it is associated with chronic opioid use, as a recent study showed that 1 in 10 patients continue to use opioids at least 3 months after curative surgery. The majority of PMPS patients are women, and, over the past 10 years, women have outpaced men in the rate of growth in opioid dependence. Standard perioperative multimodal analgesia is only modestly effective in prevention of CPSP. Thus, interventions to reduce CPSP and PMPS are urgently needed. Ketamine is well known to improve pain and reduce opioid use in the acute postoperative period. Additionally, ketamine has been shown to control mood in studies of anxiety and depression. By targeting acute pain and improving mood in the perioperative period, ketamine may be able to prevent the development of CPSP. METHODS Ketamine analgesia for long-lasting pain relief after surgery (KALPAS) is a phase 3, multicenter, randomized, placebo-controlled, double-blind trial to study the effectiveness of ketamine in reducing PMPS. The study compares continuous perioperative ketamine infusion vs single-dose ketamine in the postanesthesia care unit vs placebo for reducing PMPS. Participants are followed for 1 year after surgery. The primary outcome is pain at the surgical site at 3 months after the index surgery as assessed with the Brief Pain Inventory-short form pain severity subscale. DISCUSSION This project is part of the NIH Helping to End Addiction Long-term (HEAL) Initiative, a nationwide effort to address the opioid public health crisis. This study can substantially impact perioperative pain management and can contribute significantly to combatting the opioid epidemic. TRIAL REGISTRATION ClinicalTrials.gov NCT05037123. Registered on September 8, 2021.
Collapse
Affiliation(s)
- Jing Wang
- Department of Anesthesiology, Perioperative Care, and Pain Medicine, NYU Grossman School of Medicine, New York, NY, USA.
- Department of Neuroscience and Physiology, NYU Grossman School of Medicine, New York, NY, USA.
| | - Lisa V Doan
- Department of Anesthesiology, Perioperative Care, and Pain Medicine, NYU Grossman School of Medicine, New York, NY, USA.
| | - Deborah Axelrod
- Department of Surgery, NYU Grossman School of Medicine, New York, NY, USA
| | - John Rotrosen
- Department of Psychiatry, NYU Grossman School of Medicine, New York, NY, USA
| | - Binhuan Wang
- Department of Population Health, NYU Grossman School of Medicine, New York, NY, USA
| | - Hyung G Park
- Department of Population Health, NYU Grossman School of Medicine, New York, NY, USA
| | - Robert R Edwards
- Department of Anesthesia, Brigham and Women's Hospital, Boston, MA, USA
| | - Michele Curatolo
- Department of Anesthesiology and Pain Medicine, University of Washington, Seattle, WA, USA
| | - Carina Jackman
- Department of Anesthesiology, University of Utah, Salt Lake City, UT, USA
| | - Raven Perez
- Department of Anesthesiology, Perioperative Care, and Pain Medicine, NYU Grossman School of Medicine, New York, NY, USA
| |
Collapse
|
17
|
Spicker D, Moodie EE, Shortreed SM. Differentially Private Outcome-Weighted Learning for Optimal Dynamic Treatment Regime Estimation. Stat (Int Stat Inst) 2024; 13:e641. [PMID: 39070170 PMCID: PMC11281278 DOI: 10.1002/sta4.641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 11/12/2023] [Indexed: 07/30/2024]
Abstract
Precision medicine is a framework for developing evidence-based medical recommendations that seeks to determine the optimal sequence of treatments tailored to all of the relevant patient-level characteristics which are observable. Because precision medicine relies on highly sensitive, patient-level data, ensuring the privacy of participants is of great importance. Dynamic treatment regimes (DTRs) provide one formalization of precision medicine in a longitudinal setting. Outcome-Weighted Learning (OWL) is a family of techniques for estimating optimal DTRs based on observational data. OWL techniques leverage support vector machine (SVM) classifiers in order to perform estimation. SVMs perform classification based on a set of influential points in the data known as support vectors. The classification rule produced by SVMs often requires direct access to the support vectors. Thus, releasing a treatment policy estimated with OWL requires the release of patient data for a subset of patients in the sample. As a result, the classification rules from SVMs constitute a severe privacy violation for those individuals whose data comprise the support vectors. This privacy violation is a major concern, particularly in light of the potentially highly sensitive medical data which are used in DTR estimation. Differential privacy has emerged as a mathematical framework for ensuring the privacy of individual-level data, with provable guarantees on the likelihood that individual characteristics can be determined by an adversary. We provide the first investigation of differential privacy in the context of DTRs and provide a differentially private OWL estimator, with theoretical results allowing us to quantify the cost of privacy in terms of the accuracy of the private estimators.
Collapse
Affiliation(s)
- Dylan Spicker
- Department of Mathematics and Statistics, University of New Brunswick (Saint John), NB, Canada
| | - Erica E.M. Moodie
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, QC, Canada
| | - Susan M. Shortreed
- Kaiser Permanente Washington Health Research Institute, WA, USA
- Department of Biostatistics University of Washington, WA, USA
| |
Collapse
|
18
|
Wu D, Goldfeld KS, Petkova E, Park HG. Improving Individualized Treatment Decisions: A Bayesian Multivariate Hierarchical Model for Developing a Treatment Benefit Index using Mixed Types of Outcomes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.11.17.23298711. [PMID: 38014277 PMCID: PMC10680905 DOI: 10.1101/2023.11.17.23298711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Background Precision medicine has led to the development of targeted treatment strategies tailored to individual patients based on their characteristics and disease manifestations. Although precision medicine often focuses on a single health outcome for individualized treatment decision rules (ITRs), relying only on a single outcome rather than all available outcomes information leads to suboptimal data usage when developing optimal ITRs. Methods To address this limitation, we propose a Bayesian multivariate hierarchical model that leverages the wealth of correlated health outcomes collected in clinical trials. The approach jointly models mixed types of correlated outcomes, facilitating the "borrowing of information" across the multivariate outcomes, and results in a more accurate estimation of heterogeneous treatment effects compared to using single regression models for each outcome. We develop a treatment benefit index, which quantifies the relative treatment benefit of the experimental treatment over the control treatment, based on the proposed multivariate outcome model. Results We demonstrate the strengths of the proposed approach through extensive simulations and an application to an international Coronavirus Disease 2019 (COVID-19) treatment trial. Simulation results indicate that the proposed method reduces the occurrence of erroneous treatment decisions compared to a single regression model for a single health outcome. Additionally, the sensitivity analysis demonstrates the robustness of the model across various study scenarios. Application of the method to the COVID-19 trial exhibits improvements in estimating the individual-level treatment efficacy (indicated by narrower credible intervals for odds ratios) and optimal ITRs. Conclusion The study jointly models mixed types of outcomes in the context of developing ITRs. By considering multiple health outcomes, the proposed approach can advance the development of more effective and reliable personalized treatment.
Collapse
Affiliation(s)
- Danni Wu
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| | - Keith S. Goldfeld
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| | - Eva Petkova
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| | - Hyung G. Park
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| |
Collapse
|
19
|
Moodie EEM, Talbot D. On "Reflections on the concept of optimality of single decision point treatment regimes". Biom J 2023; 65:e2300027. [PMID: 37797173 DOI: 10.1002/bimj.202300027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 04/26/2023] [Accepted: 06/22/2023] [Indexed: 10/07/2023]
Abstract
This is a discussion of "Reflections on the concept of optimality of single decision point treatment regimes" by Trung Dung Tran, Ariel Alonso Abad, Geert Verbeke, Geert Molenberghs, and Iven Van Mechelen. The authors propose a thoughtful consideration of optimization targets and the implications of such targets for the resulting optimal treatment rule. However, we contest the assertation that targets of optimization have been overlooked and suggest additional considerations that researchers must contemplate as part of a complete framework for learning about optimal treatment regimes.
Collapse
Affiliation(s)
- Erica E M Moodie
- Department of Epidemiology & Biostatistics, McGill University, Montreal, Quebec, Canada
| | - Denis Talbot
- Department of Social and Preventive Medicine, Université Laval, Quebec, Canada
| |
Collapse
|
20
|
Suk Y, Park C. Designing Optimal, Data-Driven Policies from Multisite Randomized Trials. PSYCHOMETRIKA 2023; 88:1171-1196. [PMID: 37874510 DOI: 10.1007/s11336-023-09937-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Indexed: 10/25/2023]
Abstract
Optimal treatment regimes (OTRs) have been widely employed in computer science and personalized medicine to provide data-driven, optimal recommendations to individuals. However, previous research on OTRs has primarily focused on settings that are independent and identically distributed, with little attention given to the unique characteristics of educational settings, where students are nested within schools and there are hierarchical dependencies. The goal of this study is to propose a framework for designing OTRs from multisite randomized trials, a commonly used experimental design in education and psychology to evaluate educational programs. We investigate modifications to popular OTR methods, specifically Q-learning and weighting methods, in order to improve their performance in multisite randomized trials. A total of 12 modifications, 6 for Q-learning and 6 for weighting, are proposed by utilizing different multilevel models, moderators, and augmentations. Simulation studies reveal that all Q-learning modifications improve performance in multisite randomized trials and the modifications that incorporate random treatment effects show the most promise in handling cluster-level moderators. Among weighting methods, the modification that incorporates cluster dummies into moderator variables and augmentation terms performs best across simulation conditions. The proposed modifications are demonstrated through an application to estimate an OTR of conditional cash transfer programs using a multisite randomized trial in Colombia to maximize educational attainment.
Collapse
Affiliation(s)
- Youmi Suk
- Department of Human Development, Teachers College, Columbia University, 525 West 120th Street, New York, NY, 10027, USA.
| | - Chan Park
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, 265 South 37th Street, Philadelphia, USA
| |
Collapse
|
21
|
Liu Z, Zhan Z, Lin C, Zhang B. Estimation in optimal treatment regimes based on mean residual lifetimes with right-censored data. Biom J 2023; 65:e2200340. [PMID: 37789592 DOI: 10.1002/bimj.202200340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 07/06/2023] [Accepted: 07/09/2023] [Indexed: 10/05/2023]
Abstract
An optimal individualized treatment regime (ITR) is a decision rule in allocating the best treatment to each patient and, hence, maximizing overall benefits. In this paper, we propose a novel framework based on nonparametric inverse probability weighting (IPW) and augmented inverse probability weighting (AIPW) estimators of the value function when the data are subject to right censoring. In contrast to most existing approaches that are designed to maximize the expected survival time under a binary treatment framework, the proposed method targets maximizing the mean residual lifetime of patients. Specifically, the proposed IPW method searches the optimal ITR by maximizing an estimator for the overall population outcome directly, without specifying the regression model for the conditional mean residual lifetime, whereas the AIPW method integrates the model information of the mean residual lifetime to improve the robustness. Furthermore, to overcome the computational difficulty in a nonsmooth value estimator, smoothed IPW and AIPW estimators are constructed. In theory, we establish the asymptotic properties of the proposed method under suitable regularity conditions. The empirical performances of the proposed IPW and AIPW estimators are evaluated using simulation studies and are further illustrated with an application to the real-world data set from the Acquired Immunodeficiency Syndrome Clinical Trial Group Protocol 175 (ACTG175).
Collapse
Affiliation(s)
- Zhishuai Liu
- Department of Biostatistics & Bioinformatics, Duke University, Durham, USA
| | - Zishu Zhan
- Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou, China
| | - Cunjie Lin
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
| | - Baqun Zhang
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| |
Collapse
|
22
|
Bakoyannis G. Estimating optimal individualized treatment rules with multistate processes. Biometrics 2023; 79:2830-2842. [PMID: 37015010 PMCID: PMC10553793 DOI: 10.1111/biom.13864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 03/23/2023] [Indexed: 04/06/2023]
Abstract
Multistate process data are common in studies of chronic diseases such as cancer. These data are ideal for precision medicine purposes as they can be leveraged to improve more refined health outcomes, compared to standard survival outcomes, as well as incorporate patient preferences regarding quantity versus quality of life. However, there are currently no methods for the estimation of optimal individualized treatment rules with such data. In this paper, we propose a nonparametric outcome weighted learning approach for this problem in randomized clinical trial settings. The theoretical properties of the proposed methods, including Fisher consistency and asymptotic normality of the estimated expected outcome under the estimated optimal individualized treatment rule, are rigorously established. A consistent closed-form variance estimator is provided and methodology for the calculation of simultaneous confidence intervals is proposed. Simulation studies show that the proposed methodology and inference procedures work well even with small-sample sizes and high rates of right censoring. The methodology is illustrated using data from a randomized clinical trial on the treatment of metastatic squamous-cell carcinoma of the head and neck.
Collapse
Affiliation(s)
- Giorgos Bakoyannis
- Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, Indiana, USA
| |
Collapse
|
23
|
Liang M, Yu M. Relative contrast estimation and inference for treatment recommendation. Biometrics 2023; 79:2920-2932. [PMID: 36645310 DOI: 10.1111/biom.13826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 12/29/2022] [Indexed: 01/17/2023]
Abstract
When there are resource constraints, it may be necessary to rank individualized treatment benefits to facilitate the prioritization of assigning different treatments. Most existing literature on individualized treatment rules targets absolute conditional treatment effect differences as a metric for the benefit. However, there can be settings where relative differences may better represent such benefit. In this paper, we consider modeling such relative differences formed as scale-invariant contrasts between the conditional treatment effects. By showing that all scale-invariant contrasts are monotonic transformations of each other, we posit a single index model for a particular relative contrast. We then characterize semiparametric estimating equations, including the efficient score, to estimate index parameters. To achieve semiparametric efficiency, we propose a two-step approach that minimizes a doubly robust loss function for initial estimation and then performs a one-step efficiency augmentation procedure. Careful theoretical and numerical studies are provided to show the superiority of our proposed approach.
Collapse
Affiliation(s)
- Muxuan Liang
- Department of Biostatistics, University of Florida, Gainesville, Florida, USA
| | - Menggang Yu
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin, USA
| |
Collapse
|
24
|
Tran TD, Abad AA, Verbeke G, Molenberghs G, Van Mechelen I. Reflections on the concept of optimality of single decision point treatment regimes. Biom J 2023; 65:e2200285. [PMID: 37736675 DOI: 10.1002/bimj.202200285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 06/28/2023] [Accepted: 07/30/2023] [Indexed: 09/23/2023]
Abstract
In many areas, applied researchers as well as practitioners have to choose between different solutions for a problem at hand; this calls for optimal decision rules to settle the choices involved. As a key example, one may think of the search for optimal treatment regimes (OTRs) in clinical research, that specify which treatment alternative should be administered to each patient under study. Motivated by the fact that the concept of optimality of decision rules in general and treatment regimes in particular has received so far relatively little attention and discussion, we will present a number of reflections on it, starting from the basics of any optimization problem. Specifically, we will analyze the search space and the to be optimized criterion function underlying the search of single decision point OTRs, along with the many choice aspects that show up in their specification. Special attention is paid to formal characteristics and properties as well as to substantive concerns and hypotheses that may guide these choices. We illustrate with a few empirical examples taken from the literature. Finally, we discuss how the presented reflections may help sharpen statistical thinking about optimality of decision rules for treatment assignment and to facilitate the dialogue between the statistical consultant and the applied researcher in search of an OTR.
Collapse
Affiliation(s)
- Trung Dung Tran
- Quantitative Psychology and Individual Differences, KU Leuven, Leuven, Belgium
| | | | - Geert Verbeke
- I-BioStat, KU Leuven, Leuven, Belgium
- I-BioStat, Universiteit Hasselt, Hasselt, Belgium
| | - Geert Molenberghs
- I-BioStat, KU Leuven, Leuven, Belgium
- I-BioStat, Universiteit Hasselt, Hasselt, Belgium
| | - Iven Van Mechelen
- Quantitative Psychology and Individual Differences, KU Leuven, Leuven, Belgium
| |
Collapse
|
25
|
Zhang J, Zhang P, Ma J, Shentu Y. Covariate-adjusted value-guided subgroup identification via boosting. J Biopharm Stat 2023:1-18. [PMID: 37955423 DOI: 10.1080/10543406.2023.2275757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Accepted: 10/22/2023] [Indexed: 11/14/2023]
Abstract
It is widely recognized that treatment effects could differ across subgroups of patients. Subgroup analysis, which assesses such heterogeneity, provides valuable information in developing personalized therapies. There has been extensive research developing novel statistical methods for subgroup identification. The recent contribution is a value-guided subgroup identification method that directly maximizes treatment benefit at the subgroup level for survival outcome, rather than relying on individual treatment effect estimation. In this paper, we first completed this framework by illustrating its application to continuous and binary outcomes. More importantly, we extended the original framework to account for the prognostic effects and named this new method Covariate-Adjusted Value-guided subgroup identification via boosting (CAVboost). The original method directly used the outcome to formulate the value function for subgroup identification. Since the outcome can further be decomposed as prognostic effects and treatment effects, specifying the prognostic effects as the covariates of a model for the outcome can single out the treatment effects and improve the power to detect them across subgroups. Our proposed CAVboost was based on this key idea. It used a covariate-adjusted treatment effect estimator, instead of the outcome itself, to formulate the value function for subgroup identification. CAVboost estimates the treatment effect by using covariates to account for the prognostic effects, which mimics the idea of using covariates in an ANCOVA estimator. We showed that CAVboost could effectively improve the subgroup identification capability for both continuous and binary outcomes.
Collapse
Affiliation(s)
| | - Pingye Zhang
- Gilead Sciences Inc, Foster City, California, USA
| | - Junshui Ma
- Merck & Co. MRL, BARDS, Rahway, New Jersey, USA
| | - Yue Shentu
- Merck & Co. MRL, BARDS, Rahway, New Jersey, USA
| |
Collapse
|
26
|
Rodriguez Duque D, Moodie EEM, Stephens DA. Bayesian inference for optimal dynamic treatment regimes in practice. Int J Biostat 2023; 19:309-331. [PMID: 37192544 DOI: 10.1515/ijb-2022-0073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 03/21/2023] [Indexed: 05/18/2023]
Abstract
In this work, we examine recently developed methods for Bayesian inference of optimal dynamic treatment regimes (DTRs). DTRs are a set of treatment decision rules aimed at tailoring patient care to patient-specific characteristics, thereby falling within the realm of precision medicine. In this field, researchers seek to tailor therapy with the intention of improving health outcomes; therefore, they are most interested in identifying optimal DTRs. Recent work has developed Bayesian methods for identifying optimal DTRs in a family indexed by ψ via Bayesian dynamic marginal structural models (MSMs) (Rodriguez Duque D, Stephens DA, Moodie EEM, Klein MB. Semiparametric Bayesian inference for dynamic treatment regimes via dynamic regime marginal structural models. Biostatistics; 2022. (In Press)); we review the proposed estimation procedure and illustrate its use via the new BayesDTR R package. Although methods in Rodriguez Duque D, Stephens DA, Moodie EEM, Klein MB. (Semiparametric Bayesian inference for dynamic treatment regimes via dynamic regime marginal structural models. Biostatistics; 2022. (In Press)) can estimate optimal DTRs well, they may lead to biased estimators when the model for the expected outcome if everyone in a population were to follow a given treatment strategy, known as a value function, is misspecified or when a grid search for the optimum is employed. We describe recent work that uses a Gaussian process ( G P ) prior on the value function as a means to robustly identify optimal DTRs (Rodriguez Duque D, Stephens DA, Moodie EEM. Estimation of optimal dynamic treatment regimes using Gaussian processes; 2022. Available from: https://doi.org/10.48550/arXiv.2105.12259). We demonstrate how a G P approach may be implemented with the BayesDTR package and contrast it with other value-search approaches to identifying optimal DTRs. We use data from an HIV therapeutic trial in order to illustrate a standard analysis with these methods, using both the original observed trial data and an additional simulated component to showcase a longitudinal (two-stage DTR) analysis.
Collapse
Affiliation(s)
| | - Erica E M Moodie
- Department of Epidemiology & Biostatistics, McGill University, Montréal, QC, Canada
| | - David A Stephens
- Department of Mathematics and Statistics, McGill University, Montréal, QC, Canada
| |
Collapse
|
27
|
Johnson D, Lu W, Davidian M. A general framework for subgroup detection via one-step value difference estimation. Biometrics 2023; 79:2116-2126. [PMID: 35793474 PMCID: PMC10694635 DOI: 10.1111/biom.13711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 06/15/2022] [Indexed: 11/29/2022]
Abstract
Recent statistical methodology for precision medicine has focused on either identification of subgroups with enhanced treatment effects or estimating optimal treatment decision rules so that treatment is allocated in a way that maximizes, on average, predefined patient outcomes. Less attention has been given to subgroup testing, which involves evaluation of whether at least a subgroup of the population benefits from an investigative treatment, compared to some control or standard of care. In this work, we propose a general framework for testing for the existence of a subgroup with enhanced treatment effects based on the difference of the estimated value functions under an estimated optimal treatment regime and a fixed regime that assigns everyone to the same treatment. Our proposed test does not require specification of the parametric form of the subgroup and allows heterogeneous treatment effects within the subgroup. The test applies to cases when the outcome of interest is either a time-to-event or a (uncensored) scalar, and is valid at the exceptional law. To demonstrate the empirical performance of the proposed test, we study the type I error and power of the test statistics in simulations and also apply our test to data from a Phase III trial in patients with hematological malignancies.
Collapse
Affiliation(s)
- Dana Johnson
- United Therapeutics Corp., Research Triangle Park, Durham, North Carolina, USA
| | - Wenbin Lu
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Marie Davidian
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
28
|
Tang S, Mao S, Chen Y, Tan F, Duan L, Pian C, Zeng X. LRBmat: A novel gut microbial interaction and individual heterogeneity inference method for colorectal cancer. J Theor Biol 2023; 571:111538. [PMID: 37257720 DOI: 10.1016/j.jtbi.2023.111538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 05/07/2023] [Accepted: 05/18/2023] [Indexed: 06/02/2023]
Abstract
The gut microbial community has been shown to play a significant role in various diseases, including colorectal cancer (CRC), which is a major public health concern worldwide. The accurate diagnosis and etiological analysis of CRC are crucial issues. Numerous methods have utilized gut microbiota to address these challenges; however, few have considered the complex interactions and individual heterogeneity of the gut microbiota, which are important issues in genetics and intestinal microbiology, particularly in high-dimensional cases. This paper presents a novel method called Binary matrix based on Logistic Regression (LRBmat) to address these concerns. The binary matrix in LRBmat can directly mitigate or eliminate the influence of heterogeneity, while also capturing information on gut microbial interactions with any order. LRBmat is highly adaptable and can be combined with any machine learning method to enhance its capabilities. The proposed method was evaluated using real CRC data and demonstrated superior classification performance compared to state-of-the-art methods. Furthermore, the association rules extracted from the binary matrix of the real data align well with biological properties and existing literature, thereby aiding in the etiological analysis of CRC.
Collapse
Affiliation(s)
- Shan Tang
- Department of Statistics, Hunan University, Changsha 410006, China
| | - Shanjun Mao
- Department of Statistics, Hunan University, Changsha 410006, China.
| | - Yangyang Chen
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| | - Falong Tan
- Department of Statistics, Hunan University, Changsha 410006, China
| | - Lihua Duan
- Department of Rheumatology and Clinical Immunology, Jiangxi Provincial People's Hospital, The First Affiliated Hospital of Nanchang University, Nanchang 330006, China
| | - Cong Pian
- College of Sciences, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiangxiang Zeng
- Department of Computer Science, Hunan University, Changsha 410086, China
| |
Collapse
|
29
|
Sun W, Liu J, Hu J, Jin J, Siasoco K, Zhou R, Mccoy R. Adaptive restraint design for a diverse population through machine learning. Front Public Health 2023; 11:1202970. [PMID: 37637800 PMCID: PMC10448517 DOI: 10.3389/fpubh.2023.1202970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 07/27/2023] [Indexed: 08/29/2023] Open
Abstract
Objective Using population-based simulations and machine-learning algorithms to develop an adaptive restraint system that accounts for occupant anthropometry variations to further enhance safety balance throughout the whole population. Methods Two thousand MADYMO full frontal impact crash simulations at 35 mph using two validated vehicle/restraint models representing a sedan and an SUV along with a parametric occupant model were conducted based on the maximal projection design of experiments, which considers varying occupant covariates (sex, stature, and body mass index) and vehicle restraint design variables (three for airbag, three for safety belt, and one for knee bolster). A Gaussian-process-based surrogate model was trained to rapidly predict occupant injury risks and the associated uncertainties. An optimization framework was formulated to seek the optimal adaptive restraint design policy that minimizes the population injury risk across a wide range of occupant sizes and shapes while maintaining a low difference in injury risks among different occupant subgroups. The effectiveness of the proposed method was tested by comparing the population-wise injury risks under the adaptive design policy and the traditional state-of-the-art design. Results Compared to the traditional state-of-the-art design for midsize males, the optimal design policy shows the potential to further reduce the joint injury risk (combining head, chest, and lower extremity injury risks) among the whole population in the sedan and SUV models. Specifically, the two subgroups of vulnerable occupants including tall obese males and short obese females had higher reductions in injury risks. Conclusions This study lays out a method to adaptively adjust vehicle restraint systems to improve safety balance. This is the first study where population-based crash simulations and machine-learning methods are used to optimize adaptive restraint designs for a diverse population. Nevertheless, this study shows the high injury risks associated with obese and female occupants, which can be mitigated via restraint adaptability.
Collapse
Affiliation(s)
- Wenbo Sun
- University of Michigan Transportation Research Institute (UMTRI), College of Engineering, University of Michigan, Ann Arbor, MI, United States
| | - Jiacheng Liu
- Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, MI, United States
| | - Jingwen Hu
- University of Michigan Transportation Research Institute (UMTRI), College of Engineering, University of Michigan, Ann Arbor, MI, United States
| | - Judy Jin
- Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, MI, United States
| | | | | | | |
Collapse
|
30
|
Rodriguez Duque D, Stephens DA, Moodie EEM, Klein MB. Semiparametric Bayesian inference for optimal dynamic treatment regimes via dynamic marginal structural models. Biostatistics 2023; 24:708-727. [PMID: 35385100 DOI: 10.1093/biostatistics/kxac007] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 01/27/2022] [Accepted: 02/04/2022] [Indexed: 07/20/2023] Open
Abstract
Considerable statistical work done on dynamic treatment regimes (DTRs) is in the frequentist paradigm, but Bayesian methods may have much to offer in this setting as they allow for the appropriate representation and propagation of uncertainty, including at the individual level. In this work, we extend the use of recently developed Bayesian methods for Marginal Structural Models to arrive at inference of DTRs. We do this (i) by linking the observational world with a world in which all patients are randomized to a DTR, thereby allowing for causal inference and then (ii) by maximizing a posterior predictive utility, where the posterior distribution has been obtained from nonparametric prior assumptions on the observational world data-generating process. Our approach relies on Bayesian semiparametric inference, where inference about a finite-dimensional parameter is made all while working within an infinite-dimensional space of distributions. We further study Bayesian inference of DTRs in the double robust setting by using posterior predictive inference and the nonparametric Bayesian bootstrap. The proposed methods allow for uncertainty quantification at the individual level, thereby enabling personalized decision-making. We examine the performance of these methods via simulation and demonstrate their utility by exploring whether to adapt HIV therapy to a measure of patients' liver health, in order to minimize liver scarring.
Collapse
Affiliation(s)
- Daniel Rodriguez Duque
- Department of Epidemiology, Biostatistics, and Occupational Health, 2001 McGill College Avenue, Suite 1200 Montreal, QC, H3A 1G1, Canada
| | - David A Stephens
- Department of Mathematics and Statistics, McGill University, Burnside Hall, 805 Sherbrooke Street West Montreal, QC, H3A 0B9, Canada
| | - Erica E M Moodie
- Department of Epidemiology, Biostatistics, and Occupational Health, 2001 McGill College Avenue, Suite 1200 Montreal, QC, H3A 1G1, Canada
| | - Marina B Klein
- Division of Infectious Diseases and Chronic Viral Illness Service, Department of Medicine, McGill University Health Centre, Montreal, QC, H4A 3J1, Canada
| |
Collapse
|
31
|
Park HG, Wu D, Petkova E, Tarpey T, Ogden RT. Bayesian Index Models for Heterogeneous Treatment Effects on a Binary Outcome. STATISTICS IN BIOSCIENCES 2023; 15:397-418. [PMID: 37313546 PMCID: PMC10197073 DOI: 10.1007/s12561-023-09370-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 03/02/2023] [Accepted: 03/21/2023] [Indexed: 06/15/2023]
Abstract
This paper develops a Bayesian model with a flexible link function connecting a binary treatment response to a linear combination of covariates and a treatment indicator and the interaction between the two. Generalized linear models allowing data-driven link functions are often called "single-index models" and are among popular semi-parametric modeling methods. In this paper, we focus on modeling heterogeneous treatment effects, with the goal of developing a treatment benefit index (TBI) incorporating prior information from historical data. The model makes inference on a composite moderator of treatment effects, summarizing the effect of the predictors within a single variable through a linear projection of the predictors. This treatment benefit index can be useful for stratifying patients according to their predicted treatment benefit levels and can be especially useful for precision health applications. The proposed method is applied to a COVID-19 treatment study.
Collapse
Affiliation(s)
- Hyung G. Park
- Division of Biostatistics, Department of Population Health, New York University School of Medicine, New York, NY 10016 USA
| | - Danni Wu
- Division of Biostatistics, Department of Population Health, New York University School of Medicine, New York, NY 10016 USA
| | - Eva Petkova
- Division of Biostatistics, Department of Population Health, New York University School of Medicine, New York, NY 10016 USA
| | - Thaddeus Tarpey
- Division of Biostatistics, Department of Population Health, New York University School of Medicine, New York, NY 10016 USA
| | - R. Todd Ogden
- Department of Biostatistics, Columbia University, New York, NY 10032 USA
| |
Collapse
|
32
|
Zhou Y, Song PXK. Longitudinal self-learning of individualized treatment rules in a nutrient supplementation trial with missing data. Stat Med 2023. [PMID: 37158137 DOI: 10.1002/sim.9766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 04/24/2023] [Accepted: 04/29/2023] [Indexed: 05/10/2023]
Abstract
Longitudinal outcomes are prevalent in clinical studies, where the presence of missing data may make the statistical learning of individualized treatment rules (ITRs) a much more challenging task. We analyzed a longitudinal calcium supplementation trial in the ELEMENT Project and established a novel ITR to reduce the risk of adverse outcomes of lead exposure on child growth and development. Lead exposure, particularly in the form of in utero exposure, can seriously impair children's health, especially their cognitive and neurobehavioral development, which necessitates clinical interventions such as calcium supplementation intake during pregnancy. Using the longitudinal outcomes from a randomized clinical trial of calcium supplementation, we developed a new ITR for daily calcium intake during pregnancy to mitigate persistent lead exposure in children at age 3 years. To overcome the technical challenges posed by missing data, we illustrate a new learning approach, termed longitudinal self-learning (LS-learning), that utilizes longitudinal measurements of child's blood lead concentration in the derivation of ITR. Our LS-learning method relies on a temporally weighted self-learning paradigm to synergize serially correlated training data sources. The resulting ITR is the first of this kind in precision nutrition that will contribute to the reduction of expected blood lead concentration in children aged 0-3 years should this ITR be implemented to the entire study population of pregnant women.
Collapse
Affiliation(s)
- Yiwang Zhou
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Peter X K Song
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
33
|
Lou J, Wang Y, Li L, Zeng D. Estimating individualized treatment rules for multicategory type 2 diabetes treatments using electronic health records. STATISTICS AND ITS INTERFACE 2023; 16:505-515. [PMID: 38344146 PMCID: PMC10857856 DOI: 10.4310/22-sii739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/15/2024]
Abstract
In this article, we propose a general framework to learn optimal treatment rules for type 2 diabetes (T2D) patients using electronic health records (EHRs). We first propose a joint modeling approach to characterize patient's pretreatment conditions using longitudinal markers from EHRs. The estimation accounts for informative measurement times using inverse-intensity weighting methods. The predicted latent processes in the joint model are used to divide patients into a finite of subgroups and, within each group, patients share similar health profiles in EHRs. Within each patient group, we estimate optimal individualized treatment rules by extending a matched learning method to handle multicategory treatments using a one-versus-one approach. Each matched learning for two treatments is implemented by a weighted support vector machine with matched pairs of patients. We apply our method to estimate optimal treatment rules for T2D patients in a large sample of EHRs from the Ohio State University Wexner Medical Center. We demonstrate the utility of our method to select the optimal treatments from four classes of drugs and achieve a better control of glycated hemoglobin than any one-size-fits-all rules.
Collapse
Affiliation(s)
- Jitong Lou
- 135 Dauer Drive, 3101 McGavran-Greenberg Hall, Chapel Hill, NC 27599, USA
| | - Yuanjia Wang
- 722 West 168th Street, Rm 210, New York, NY 10032, USA
| | - Lang Li
- 250 Lincoln Tower, 1800 Cannon Drive, Columbus, OH 43210, USA
| | - Donglin Zeng
- 135 Dauer Drive, 3103B McGavran-Greenberg Hall, Chapel Hill, NC 27599, USA
| |
Collapse
|
34
|
Orwitz N, Tarpey T, Petkova E. Confidence in the treatment decision for an individual patient: strategies for sequential assessment. STATISTICS AND ITS INTERFACE 2023; 16:475-491. [PMID: 37274458 PMCID: PMC10238081 DOI: 10.4310/22-sii737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Evolving medical technologies have motivated the development of treatment decision rules (TDRs) that incorporate complex, costly data (e.g., imaging). In clinical practice, we aim for TDRs to be valuable by reducing unnecessary testing while still identifying the best possible treatment for a patient. Regardless of how well any TDR performs in the target population, there is an associated degree of uncertainty about its optimality for a specific patient. In this paper, we aim to quantify, via a confidence measure, the uncertainty in a TDR as patient data from sequential procedures accumulate in real-time. We first propose estimating confidence using the distance of a patient's vector of covariates to a treatment decision boundary, with further distances corresponding to higher certainty. We further propose measuring confidence through the conditional probabilities of ultimately (with all possible information available) being assigned a particular treatment, given that the same treatment is assigned with the patient's currently available data or given the treatment recommendation made using only the currently available patient data. As patient data accumulate, the treatment decision is updated and confidence reassessed until a sufficiently high confidence level is achieved. We present results from simulation studies and illustrate the methods using a motivating example from a depression clinical trial. Recommendations for practical use of the measures are proposed.
Collapse
Affiliation(s)
- Nina Orwitz
- 180 Madison Avenue, New York, NY 10016, United States of America
| | - Thaddeus Tarpey
- 180 Madison Avenue, New York, NY 10016, United States of America
| | - Eva Petkova
- 180 Madison Avenue, New York, NY 10016, United States of America
| |
Collapse
|
35
|
Ghosh P, Yan X, Chakraborty B. A novel approach to assess dynamic treatment regimes embedded in a SMART with an ordinal outcome. Stat Med 2023; 42:1096-1111. [PMID: 36726310 DOI: 10.1002/sim.9659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 10/21/2022] [Accepted: 01/04/2023] [Indexed: 02/03/2023]
Abstract
Sequential multiple assignment randomized trials (SMARTs) are used to construct data-driven optimal intervention strategies for subjects based on their intervention and covariate histories in different branches of health and behavioral sciences where a sequence of interventions is given to a participant. Sequential intervention strategies are often called dynamic treatment regimes (DTR). In the existing literature, the majority of the analysis methodologies for SMART data assume a continuous primary outcome. However, ordinal outcomes are also quite common in clinical practice. In this work, first, we introduce the notion of generalized odds ratio ( G O R $$ GOR $$ ) to compare two DTRs embedded in a SMART with an ordinal outcome and discuss some combinatorial properties of this measure. Next, we propose a likelihood-based approach to estimate G O R $$ GOR $$ from SMART data, and derive the asymptotic properties of its estimate. We discuss alternative ways to estimate G O R $$ GOR $$ using concordant-discordant pairs and two-sample U $$ U $$ -statistic. We derive the required sample size formula for designing SMARTs with ordinal outcomes based on G O R $$ GOR $$ . A simulation study shows the performance of the estimated G O R $$ GOR $$ in terms of the estimated power corresponding to the derived sample size. The methodology is applied to analyze data from the SMART+ study, conducted in the UK, to improve carbohydrate periodization behavior in athletes using a menu planner mobile application, Hexis Performance. A freely available Shiny web app using R is provided to make the proposed methodology accessible to other researchers and practitioners.
Collapse
Affiliation(s)
- Palash Ghosh
- Department of Mathematics, Indian Institute of Technology Guwahati, Assam, India.,Jyoti and Bhupat Mehta School of Health Sciences and Technology, Indian Institute of Technology Guwahati, Assam, India.,Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Xiaoxi Yan
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Bibhas Chakraborty
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore.,Department of Statistics and Data Science, National University of Singapore, Singapore, Singapore.,Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina
| |
Collapse
|
36
|
Park H, Petkova E, Tarpey T, Ogden RT. Functional additive models for optimizing individualized treatment rules. Biometrics 2023; 79:113-126. [PMID: 34704622 PMCID: PMC9043034 DOI: 10.1111/biom.13586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 10/01/2021] [Accepted: 10/14/2021] [Indexed: 11/29/2022]
Abstract
A novel functional additive model is proposed, which is uniquely modified and constrained to model nonlinear interactions between a treatment indicator and a potentially large number of functional and/or scalar pretreatment covariates. The primary motivation for this approach is to optimize individualized treatment rules based on data from a randomized clinical trial. We generalize functional additive regression models by incorporating treatment-specific components into additive effect components. A structural constraint is imposed on the treatment-specific components in order to provide a class of additive models with main effects and interaction effects that are orthogonal to each other. If primary interest is in the interaction between treatment and the covariates, as is generally the case when optimizing individualized treatment rules, we can thereby circumvent the need to estimate the main effects of the covariates, obviating the need to specify their form and thus avoiding the issue of model misspecification. The methods are illustrated with data from a depression clinical trial with electroencephalogram functional data as patients' pretreatment covariates.
Collapse
Affiliation(s)
- Hyung Park
- Division of Biostatistics, Department of Population Health, New York University, New York, NY 10016, USA
| | - Eva Petkova
- Division of Biostatistics, Department of Population Health, New York University, New York, NY 10016, USA
| | - Thaddeus Tarpey
- Division of Biostatistics, Department of Population Health, New York University, New York, NY 10016, USA
| | - R. Todd Ogden
- Department of Biostatistics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
37
|
Li Z, Chen J, Laber E, Liu F, Baumgartner R. Optimal Treatment Regimes: A Review and Empirical Comparison. Int Stat Rev 2023. [DOI: 10.1111/insr.12536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Affiliation(s)
- Zhen Li
- Department of Statistics North Carolina State University Raleigh 27607 NC USA
| | - Jie Chen
- Department of Biometrics Overland Pharmaceuticals Dover 19901 DE USA
| | - Eric Laber
- Department of Statistical Science, Department of Biostatistics and Bioinformatics Duke University Durham 27708 NC USA
| | - Fang Liu
- Biostatistics and Research Decision Sciences Merck & Co., Inc. Kenilworth NJ 07033 USA
| | - Richard Baumgartner
- Biostatistics and Research Decision Sciences Merck & Co., Inc. Kenilworth NJ 07033 USA
| |
Collapse
|
38
|
Li C, Li W, Zhu W. Penalized robust learning for optimal treatment regimes with heterogeneous individualized treatment effects. J Appl Stat 2023; 51:1151-1170. [PMID: 38628447 PMCID: PMC11018073 DOI: 10.1080/02664763.2023.2180167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 02/05/2023] [Indexed: 02/22/2023]
Abstract
The growing popularity of personalized medicine motivates people to explore individualized treatment regimes according to heterogeneous characteristics of the patients. For the large-scale data analysis, however, the data are collected at different times and different locations, i.e. subjects are usually from a heterogeneous population, which causes that the optimal treatment regimes also vary for patients across different subgroups. In this paper, we mainly focus on the estimation of optimal treatment regimes for subjects come from a heterogeneous population with high-dimensional data. We first remove the main effects of the covariates for each subgroup to eliminate non-ignorable residual confounding. Based on the centralized outcome, we propose a penalized robust learning that estimates the coefficient matrix of the interactions between covariates and treatment by penalizing pairwise differences of the coefficients of any two subgroups for the same covariate, which can automatically identify the latent complex structure of the coefficient matrix with heterogeneous and homogeneous columns. At the same time, the penalized robust learning can also select the important variables that truly contribute to the individualized treatment decisions with commonly used sparsity structure penalty. Extensive simulation studies show that our proposed method outperforms current popular methods, and it is further illustrated in the real analysis of the Tamoxifen breast cancer data.
Collapse
Affiliation(s)
- Canhui Li
- Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| | - Weirong Li
- Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| | - Wensheng Zhu
- Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| |
Collapse
|
39
|
Liu P, Li J, Kosorok MR. Change plane model averaging for subgroup identification. Stat Methods Med Res 2023; 32:773-788. [PMID: 36775991 DOI: 10.1177/09622802231154327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/14/2023]
Abstract
Central to personalized medicine and tailored therapies is discovering the subpopulations that account for treatment effect heterogeneity and are likely to benefit more from given interventions. In this article, we introduce a change plane model averaging method to identify subgroups characterized by linear combinations of predictive variables and multiple cut-offs. We first fit a sequence of statistical models, each incorporating the thresholding effect of one particular covariate. The estimation of submodels is accomplished through an iterative integration of a change point detection method and numerical optimization algorithms. A frequentist model averaging approach is then employed to linearly combine the submodels with optimal weights. Our approach can deal with high-dimensional settings involving enormous potential grouping variables by adopting the sparsity-inducing penalties. Simulation studies are conducted to investigate the prediction and subgrouping performance of the proposed method, with a comparison to various competing subgroup detection methods. Our method is applied to a dataset from a warfarin pharmacogenetics study, producing some new findings.
Collapse
Affiliation(s)
- Pan Liu
- Department of Statistics and Data Science, 37580National University of Singapore, Singapore, Singapore
| | - Jialiang Li
- Department of Statistics and Data Science, 37580National University of Singapore, Singapore, Singapore.,Duke University NUS Graduate Medical School, Singapore, Singapore
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, USA
| |
Collapse
|
40
|
Accountable survival contrast-learning for optimal dynamic treatment regimes. Sci Rep 2023; 13:2250. [PMID: 36755137 PMCID: PMC9908913 DOI: 10.1038/s41598-023-29106-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 01/30/2023] [Indexed: 02/10/2023] Open
Abstract
Dynamic treatment regime (DTR) is an emerging paradigm in recent medical studies, which searches a series of decision rules to assign optimal treatments to each patient by taking into account individual features such as genetic, environmental, and social factors. Although there is a large and growing literature on statistical methods to estimate optimal treatment regimes, most methodologies focused on complete data. In this article, we propose an accountable contrast-learning algorithm for optimal dynamic treatment regime with survival endpoints. Our estimating procedure is originated from a doubly-robust weighted classification scheme, which is a model-based contrast-learning method that directly characterizes the interaction terms between predictors and treatments without main effects. To reflect the censorship, we adopt the pseudo-value approach that replaces survival quantities with pseudo-observations for the time-to-event outcome. Unlike many existing approaches, mostly based on complicated outcome regression modeling or inverse-probability weighting schemes, the pseudo-value approach greatly simplifies the estimating procedure for optimal treatment regime by allowing investigators to conveniently apply standard machine learning techniques to censored survival data without losing much efficiency. We further explore a SCAD-penalization to find informative clinical variables and modified algorithms to handle multiple treatment options by searching upper and lower bounds of the objective function. We demonstrate the utility of our proposal via extensive simulations and application to AIDS data.
Collapse
|
41
|
Zhou J, Zhang Y, Tu W. A reference-free R-learner for treatment recommendation. Stat Methods Med Res 2023; 32:404-424. [PMID: 36540907 DOI: 10.1177/09622802221144326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Assigning optimal treatments to individual patients based on their characteristics is the ultimate goal of precision medicine. Deriving evidence-based recommendations from observational data while considering the causal treatment effects and patient heterogeneity is a challenging task, especially in situations of multiple treatment options. Herein, we propose a reference-free R-learner based on a simplex algorithm for treatment recommendation. We showed through extensive simulation that the proposed method produced accurate recommendations that corresponded to optimal treatment outcomes, regardless of the reference group. We used the method to analyze data from the Systolic Blood Pressure Intervention Trial (SPRINT) and achieved recommendations consistent with the current clinical guidelines.
Collapse
Affiliation(s)
- Junyi Zhou
- Design and Inovation, 7129Amgen Inc., Thousand Oaks, CA, USA
| | - Ying Zhang
- Department of Biostatistics, 12284University of Nebraska Medical Center, Omaha, NE, USA
| | - Wanzhu Tu
- Department of Biostatistics and Health Data Science, Indiana University-School of Medicine and Fairbanks School of Public Health, Indianapolis, IN, USA
| |
Collapse
|
42
|
Kahkoska AR, Freeman NLB, Jones EP, Shirazi D, Browder S, Page A, Sperger J, Zikry TM, Yu F, Busby-Whitehead J, Kosorok MR, Batsis JA. Individualized interventions and precision health: Lessons learned from a systematic review and implications for analytics-driven geriatric research. J Am Geriatr Soc 2023; 71:383-393. [PMID: 36524627 PMCID: PMC10037848 DOI: 10.1111/jgs.18141] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 09/16/2022] [Accepted: 10/22/2022] [Indexed: 12/23/2022]
Abstract
Older adults are characterized by profound clinical heterogeneity. When designing and delivering interventions, there exist multiple approaches to account for heterogeneity. We present the results of a systematic review of data-driven, personalized interventions in older adults, which serves as a use case to distinguish the conceptual and methodologic differences between individualized intervention delivery and precision health-derived interventions. We define individualized interventions as those where all participants received the same parent intervention, modified on a case-by-case basis and using an evidence-based protocol, supplemented by clinical judgment as appropriate, while precision health-derived interventions are those that tailor care to individuals whereby the strategy for how to tailor care was determined through data-driven, precision health analytics. We discuss how their integration may offer new opportunities for analytics-based geriatric medicine that accommodates individual heterogeneity but allows for more flexible and resource-efficient population-level scaling.
Collapse
Affiliation(s)
- Anna R. Kahkoska
- Department of Nutrition, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Nikki L. B. Freeman
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Emily P. Jones
- Health Sciences Library, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Daniela Shirazi
- Department of Medicine, California University of Science and Medicine, Colton, California, USA
| | - Sydney Browder
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Annie Page
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - John Sperger
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Tarek M. Zikry
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Fei Yu
- School of Information and Library Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Jan Busby-Whitehead
- Division of Geriatric Medicine, Department of Medicine, Center for Aging and Health, School of Medicine, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Michael R. Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - John A. Batsis
- Department of Nutrition, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
- Division of Geriatric Medicine, Department of Medicine, Center for Aging and Health, School of Medicine, University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|
43
|
Wang X, Chakraborty B. The Sequential Multiple Assignment Randomized Trial for Controlling Infectious Diseases: A Review of Recent Developments. Am J Public Health 2023; 113:49-59. [PMID: 36516390 PMCID: PMC9755933 DOI: 10.2105/ajph.2022.307135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Infectious diseases have posed severe threats to public health across the world. Effective prevention and control of infectious diseases in the long term requires adapting interventions based on epidemiological evidence. The sequential multiple assignment randomized trial (SMART) is a multistage randomized trial that can provide valid evidence of when and how to adapt interventions for controlling infectious diseases based on evolving epidemiological evidence. We review recent developments in SMARTs to bring wider attention to the potential benefits of employing SMARTs in constructing effective adaptive interventions for controlling infectious diseases and other threats to public health. We discuss 2 example SMARTs for infectious diseases and summarize recent developments in SMARTs from the varied aspects of design, analysis, cost, and ethics. Public health investigators are encouraged to familiarize themselves with the related materials we discuss and collaborate with experts in SMARTs to translate the methodological developments into preeminent public health research. (Am J Public Health. 2023;113(1):49-59. https://doi.org/10.2105/AJPH.2022.307135).
Collapse
Affiliation(s)
- Xinru Wang
- Xinru Wang and Bibhas Chakraborty are with the Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore. Bibhas Chakraborty is also with the Department of Statistics and Data Science, National University of Singapore, Singapore
| | - Bibhas Chakraborty
- Xinru Wang and Bibhas Chakraborty are with the Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore. Bibhas Chakraborty is also with the Department of Statistics and Data Science, National University of Singapore, Singapore
| |
Collapse
|
44
|
Rose EJ, Moodie EEM, Shortreed S. Using Pilot Data for Power Analysis of Observational Studies for the Estimation of Dynamic Treatment Regimes. OBSERVATIONAL STUDIES 2023; 9:25-48. [PMID: 39005256 PMCID: PMC11245299 DOI: 10.1353/obs.2023.a906627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Significant attention has been given to developing data-driven methods for tailoring patient care based on individual patient characteristics. Dynamic treatment regimes formalize this approach through a sequence of decision rules that map patient information to a suggested treatment. The data for estimating and evaluating treatment regimes are ideally gathered through the use of Sequential Multiple Assignment Randomized Trials (SMARTs), though longitudinal observational studies are commonly used due to the potentially prohibitive costs of conducting a SMART. Observational studies are typically powered for simple comparisons of fixed treatment sequences; a priori power or sample size calculations for tailored strategies are rarely if ever undertaken. This has lead to many studies that fail to find a statistically significant benefit to tailoring treatment. We develop power analyses for the estimation of dynamic treatment regimes from observational studies. Our approach uses pilot data to estimate the power for comparing the value of the optimal regime, i.e., the expected outcome if all patients in the population were treated by following the optimal regime, with a known comparison mean. This allows for calculations that ensure a study has sufficient power to detect the need for tailoring, should it be present. Our approach also ensures the value of the estimated optimal treatment regime has a high probability of being within a range of the value of the true optimal regime, set a priori. We examine the performance of the proposed procedure with a simulation study and use it to size a study for reducing depressive symptoms using data from electronic health records.
Collapse
Affiliation(s)
- Eric J Rose
- Department of Epidemiology and Biostatistics, University at Albany, Rensselaer, NY, 12144, USA
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, QC, H3A 1G1, Canada
| | - Erica E M Moodie
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, QC, H3A 1G1, Canada
| | - Susan Shortreed
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, 98101, USA
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| |
Collapse
|
45
|
Ma H, Zeng D, Liu Y. Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2023; 24:102. [PMID: 37588020 PMCID: PMC10426767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Data driven individualized decision making problems have received a lot of attentions in recent years. In particular, decision makers aim to determine the optimal Individualized Treatment Rule (ITR) so that the expected specified outcome averaging over heterogeneous patient-specific characteristics is maximized. Many existing methods deal with binary or a moderate number of treatment arms and may not take potential treatment effect structure into account. However, the effectiveness of these methods may deteriorate when the number of treatment arms becomes large. In this article, we propose GRoup Outcome Weighted Learning (GROWL) to estimate the latent structure in the treatment space and the optimal group-structured ITRs through a single optimization. In particular, for estimating group-structured ITRs, we utilize the Reinforced Angle based Multicategory Support Vector Machines (RAMSVM) to learn group-based decision rules under the weighted angle based multi-class classification framework. Fisher consistency, the excess risk bound, and the convergence rate of the value function are established to provide a theoretical guarantee for GROWL. Extensive empirical results in simulation studies and real data analysis demonstrate that GROWL enjoys better performance than several other existing methods.
Collapse
Affiliation(s)
- Haixu Ma
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Science, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
46
|
Model selection for survival individualized treatment rules using the jackknife estimator. BMC Med Res Methodol 2022; 22:328. [PMID: 36550398 PMCID: PMC9773469 DOI: 10.1186/s12874-022-01811-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 12/01/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Precision medicine is an emerging field that involves the selection of treatments based on patients' individual prognostic data. It is formalized through the identification of individualized treatment rules (ITRs) that maximize a clinical outcome. When the type of outcome is time-to-event, the correct handling of censoring is crucial for estimating reliable optimal ITRs. METHODS We propose a jackknife estimator of the value function to allow for right-censored data for a binary treatment. The jackknife estimator or leave-one-out-cross-validation approach can be used to estimate the value function and select optimal ITRs using existing machine learning methods. We address the issue of censoring in survival data by introducing an inverse probability of censoring weighted (IPCW) adjustment in the expression of the jackknife estimator of the value function. In this paper, we estimate the optimal ITR by using random survival forest (RSF) and Cox proportional hazards model (COX). We use a Z-test to compare the optimal ITRs learned by RSF and COX with the zero-order model (or one-size-fits-all). Through simulation studies, we investigate the asymptotic properties and the performance of our proposed estimator under different censoring rates. We illustrate our proposed method on a phase III clinical trial of non-small cell lung cancer data. RESULTS Our simulations show that COX outperforms RSF for small sample sizes. As sample sizes increase, the performance of RSF improves, in particular when the expected log failure time is not linear in the covariates. The estimator is fairly normally distributed across different combinations of simulation scenarios and censoring rates. When applied to a non-small-cell lung cancer data set, our method determines the zero-order model (ZOM) as the best performing model. This finding highlights the possibility that tailoring may not be needed for this cancer data set. CONCLUSION The jackknife approach for estimating the value function in the presence of right-censored data shows satisfactory performance when there is small to moderate censoring. Winsorizing the upper and lower percentiles of the estimated survival weights for computing the IPCWs stabilizes the estimator.
Collapse
|
47
|
Rudolph KE, Díaz I. When the Ends do not Justify the Means: Learning Who is Predicted to Have Harmful Indirect Effects. JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A, (STATISTICS IN SOCIETY) 2022; 185:S573-S589. [PMID: 37397280 PMCID: PMC10312488 DOI: 10.1111/rssa.12951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
There is a growing literature on finding rules by which to assign treatment based on an individual's characteristics such that a desired outcome under the intervention is maximized. A related goal entails identifying a subpopulation of individuals predicted to have a harmful indirect effect (the effect of treatment on an outcome through mediators), perhaps even in the presence of a predicted beneficial total treatment effect. In some cases, the implications of a likely harmful indirect effect may outweigh an anticipated beneficial total treatment effect, and would motivate further discussion of whether to treat identified individuals. We build on the mediation and optimal treatment rule literatures to propose a method of identifying a subgroup for which the treatment effect through the mediator is expected to be harmful. Our approach is nonparametric, incorporates post-treatment confounders of the mediator-outcome relationship, and does not make restrictions on the distribution of baseline covariates, mediating variables, or outcomes. We apply the proposed approach to identify a subgroup of boys in the MTO housing voucher experiment who are predicted to have a harmful indirect effect of housing voucher receipt on subsequent psychiatric disorder incidence through aspects of their school and neighborhood environments.
Collapse
Affiliation(s)
- Kara E Rudolph
- Department of Epidemiology, Mailman School of Public Health, Columbia University
| | - Iván Díaz
- Division of Biostatistics, Department of Population Health Sciences, Weill Cornell Medicine
| |
Collapse
|
48
|
Chen B, Yuan A, Qin J. Pool adjacent violators algorithm-assisted learning with application on estimating optimal individualized treatment regimes. Biometrics 2022; 78:1475-1488. [PMID: 34181761 DOI: 10.1111/biom.13511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 05/17/2021] [Accepted: 06/09/2021] [Indexed: 12/30/2022]
Abstract
Personalized medicine allows individuals to choose the best fit of their treatments based on their characteristics through an individualized treatment regime. In this paper, we develop a pool adjacent violators algorithm-assisted learning method to find the optimal individualized treatment regime under the monotone single-index outcome gain model. The proposed estimator is more efficient than peers, and it is robust to the misspecification of the propensity score model or the baseline regression model. The optimal treatment regime is also robust to the misspecification of the functional form of the expected outcome gain model. Simulation studies verified our theoretical results. We also provide an estimate of the expected outcome gain model. Plotting the expected outcome gain versus an individual's characteristics index can visualize how significant the treatment effect is over the control. We apply the proposed method to an AIDS study.
Collapse
Affiliation(s)
- Baojiang Chen
- Department of Biostatistics and Data Science, University of Texas Health Science Center at Houston, School of Public Health in Austin, Austin, Texas
| | - Ao Yuan
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington, District of Columbia
| | - Jing Qin
- National Institute of Allergy and Infectious Diseases, National Institute of Health, Bethesda, Maryland
| |
Collapse
|
49
|
Chen X, Song R, Zhang J, Adams SA, Sun L, Lu W. On estimating optimal regime for treatment initiation time based on restricted mean residual lifetime. Biometrics 2022; 78:1377-1389. [PMID: 34263933 DOI: 10.1111/biom.13530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 05/13/2021] [Accepted: 07/08/2021] [Indexed: 12/30/2022]
Abstract
When to initiate treatment on patients is an important problem in many medical studies such as AIDS and cancer. In this article, we formulate the treatment initiation time problem for time-to-event data and propose an optimal individualized regime that determines the best treatment initiation time for individual patients based on their characteristics. Different from existing optimal treatment regimes where treatments are undertaken at a pre-specified time, here new challenges arise from the complicated missing mechanisms in treatment initiation time data and the continuous treatment rule in terms of initiation time. To tackle these challenges, we propose to use restricted mean residual lifetime as a value function to evaluate the performance of different treatment initiation regimes, and develop a nonparametric estimator for the value function, which is consistent even when treatment initiation times are not completely observable and their distribution is unknown. We also establish the asymptotic properties of the resulting estimator in the decision rule and its associated value function estimator. In particular, the asymptotic distribution of the estimated value function is nonstandard, which follows a weighted chi-squared distribution. The finite-sample performance of the proposed method is evaluated by simulation studies and is further illustrated with an application to a breast cancer data.
Collapse
Affiliation(s)
- Xin Chen
- School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai, China.,Interdisciplinary Research Institute of Data Science, Shanghai Lixin University of Accounting and Finance, Shanghai, China
| | - Rui Song
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Jiajia Zhang
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, South Carolina, USA
| | - Swann Arp Adams
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, South Carolina, USA.,College of Nursing, University of South Carolina, Columbia, South Carolina, USA
| | - Liuquan Sun
- Institute of Applied Mathematics, Chinese Academy of Science, Beijing, China
| | - Wenbin Lu
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
50
|
Yao L, Tarpey T. A Single Index Model for Longitudinal Outcomes to Optimize Individual Treatment Decision Rules. Stat (Int Stat Inst) 2022; 11:e493. [PMID: 38770026 PMCID: PMC11105108 DOI: 10.1002/sta4.493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 08/04/2022] [Indexed: 11/07/2022]
Abstract
A pressing challenge in medical research is to identify optimal treatments for individual patients. This is particularly challenging in mental health settings where mean responses are often similar across multiple treatments. For example, the mean longitudinal trajectories for patients treated with an active drug and placebo may be very similar but different treatments may exhibit distinctly different individual trajectory shapes. Most precision medicine approaches using longitudinal data often ignore information from the longitudinal data structure. This paper investigates a powerful precision medicine approach by examining the impact of baseline covariates on longitudinal outcome trajectories to guide treatment decisions instead of traditional scalar outcome measures derived from longitudinal data, such as a change score. We introduce a method of estimating "biosignatures" defined as linear combinations of baseline characteristics (i.e., a single index) that optimally separate longitudinal trajectories among different treatment groups. The criterion used is to maximize the Kullback-Leibler Divergence between different treatment outcome distributions. The approach is illustrated via simulation studies and a depression clinical trial. The approach is also contrasted with more traditional methods and compares performance in the presence of missing data.
Collapse
Affiliation(s)
- Lanqiu Yao
- Division of Biostatistics, Department of Population Health, NYU Grossman School of Medicine, NY, USA
| | - Thaddeus Tarpey
- Division of Biostatistics, Department of Population Health, NYU Grossman School of Medicine, NY, USA
| |
Collapse
|