1
|
Inoue K, Adomi M, Efthimiou O, Komura T, Omae K, Onishi A, Tsutsumi Y, Fujii T, Kondo N, Furukawa TA. Machine learning approaches to evaluate heterogeneous treatment effects in randomized controlled trials: a scoping review. J Clin Epidemiol 2024; 176:111538. [PMID: 39305940 DOI: 10.1016/j.jclinepi.2024.111538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 09/06/2024] [Accepted: 09/16/2024] [Indexed: 10/20/2024]
Abstract
BACKGROUND AND OBJECTIVES Estimating heterogeneous treatment effects (HTEs) in randomized controlled trials (RCTs) has received substantial attention recently. This has led to the development of several statistical and machine learning (ML) algorithms to assess HTEs through identifying individualized treatment effects. However, a comprehensive review of these algorithms is lacking. We thus aimed to catalog and outline currently available statistical and ML methods for identifying HTEs via effect modeling using clinical RCT data and summarize how they have been applied in practice. STUDY DESIGN AND SETTING We performed a scoping review using prespecified search terms in MEDLINE and Embase, aiming to identify studies that assessed HTEs using advanced statistical and ML methods in RCT data published from 2010 to 2022. RESULTS Among a total of 32 studies identified in the review, 17 studies applied existing algorithms to RCT data, and 15 extended existing algorithms or proposed new algorithms. Applied algorithms included penalized regression, causal forest, Bayesian causal forest, and other metalearner frameworks. Of these methods, causal forest was the most frequently used (7 studies) followed by Bayesian causal forest (4 studies). Most applications were in cardiology (6 studies), followed by psychiatry (4 studies). We provide example R codes in simulated data to illustrate how to implement these algorithms. CONCLUSION This review identified and outlined various algorithms currently used to identify HTEs and individualized treatment effects in RCT data. Given the increasing availability of new algorithms, analysts should carefully select them after examining model performance and considering how the models will be used in practice.
Collapse
Affiliation(s)
- Kosuke Inoue
- Department of Social Epidemiology, Graduate School of Medicine, Kyoto University, Kyoto, Japan; Hakubi Center, Kyoto University, Kyoto, Japan.
| | - Motohiko Adomi
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Orestis Efthimiou
- Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland; Institute of Social and Preventive Medicine (ISPM), University of Bern, Bern, Switzerland
| | - Toshiaki Komura
- Department of Epidemiology, School of Public Health, Boston University, Boston, MA, USA
| | - Kenji Omae
- Department of Innovative Research and Education for Clinicians and Trainees, Fukushima Medical University Hospital, Fukushima, Japan; Center for Innovative Research for Communities and Clinical Excellence, Fukushima Medical University, Fukushima, Japan
| | - Akira Onishi
- Department of Advanced Medicine for Rheumatic Diseases, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Yusuke Tsutsumi
- Human Health Sciences, Kyoto University Graduate School of Medicine, Kyoto, Japan; Department of Emergency Medicine, National Hospital Organization Mito Medical Center, Ibaraki, Japan
| | - Tomoko Fujii
- Intensive Care Unit, Jikei University Hospital, Tokyo, Japan; Departments of Health Promotion and Human Behavior and of Clinical Epidemiology, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto, Japan
| | - Naoki Kondo
- Department of Social Epidemiology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Toshi A Furukawa
- Departments of Health Promotion and Human Behavior and of Clinical Epidemiology, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto, Japan
| |
Collapse
|
2
|
Thangaraj PM, Shankar SV, Huang S, Nadkarni GN, Mortazavi BJ, Oikonomou EK, Khera R. A Novel Digital Twin Strategy to Examine the Implications of Randomized Clinical Trials for Real-World Populations. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.25.24304868. [PMID: 38585929 PMCID: PMC10996766 DOI: 10.1101/2024.03.25.24304868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Randomized clinical trials (RCTs) are essential to guide medical practice; however, their generalizability to a given population is often uncertain. We developed a statistically informed Generative Adversarial Network (GAN) model, RCT-Twin-GAN, that leverages relationships between covariates and outcomes and generates a digital twin of an RCT (RCT-Twin) conditioned on covariate distributions from a second patient population. We used RCT-Twin-GAN to reproduce treatment effect outcomes of the Systolic Blood Pressure Intervention Trial (SPRINT) and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Blood Pressure Trial, which tested the same intervention but found different treatment effects. To demonstrate treatment effect estimates of each RCT conditioned on the other RCT's patient population, we evaluated the cardiovascular event-free survival of SPRINT digital twins conditioned on the ACCORD cohort and vice versa (ACCORD twins conditioned on SPRINT). The conditioned digital twins were balanced across intervention and control arms (mean absolute standardized mean difference (MASMD) of covariates between treatment arms 0.019 (SD 0.018), and the conditioned covariates of the SPRINT-Twin on ACCORD were more similar to ACCORD than SPRINT (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20). Notably, across iterations, SPRINT conditioned ACCORD-Twin datasets reproduced the overall non-significant effect size seen in ACCORD (5-year cardiovascular outcome hazard ratio (95% confidence interval) of 0.88 (0.73-1.06) in ACCORD vs. median 0.87 (0.68-1.13) in the SPRINT conditioned ACCORD-Twin), while the ACCORD conditioned SPRINT-Twins reproduced the significant effect size seen in SPRINT (0.75 (0.64-0.89) vs. median 0.79 (0.72-0.86)) in the ACCORD conditioned SPRINT-Twin). Finally, we demonstrate the translation of this approach to real-world populations by conditioning the trials on an electronic health record population. Therefore, RCT-Twin-GAN simulates the direct translation of RCT-derived treatment effects across various patient populations.
Collapse
Affiliation(s)
- Phyllis M. Thangaraj
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Sumukh Vasisht Shankar
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Sicong Huang
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX
| | - Girish N. Nadkarni
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- The Division of Data Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Bobak J. Mortazavi
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX
| | - Evangelos K. Oikonomou
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Rohan Khera
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
- Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, New Haven, CT
- Section of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, CT
- Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, New Haven, CT, USA
| |
Collapse
|
3
|
Graham Linck EJ, Goligher EC, Semler MW, Churpek MM. Toward Precision in Critical Care Research: Methods for Observational and Interventional Studies. Crit Care Med 2024; 52:1439-1450. [PMID: 39145702 PMCID: PMC11328956 DOI: 10.1097/ccm.0000000000006371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2024]
Abstract
Critical care trials evaluate the effect of interventions in patients with diverse personal histories and causes of illness, often under the umbrella of heterogeneous clinical syndromes, such as sepsis or acute respiratory distress syndrome. Given this variation, it is reasonable to expect that the effect of treatment on outcomes may differ for individuals with variable characteristics. However, in randomized controlled trials, efficacy is typically assessed by the average treatment effect (ATE), which quantifies the average effect of the intervention on the outcome in the study population. Importantly, the ATE may hide variations of the treatment's effect on a clinical outcome across levels of patient characteristics, which may erroneously lead to the conclusion that an intervention does not work overall when it may in fact benefit certain patients. In this review, we describe methodological approaches for assessing heterogeneity of treatment effect (HTE), including expert-derived subgrouping, data-driven subgrouping, baseline risk modeling, treatment effect modeling, and individual treatment rule estimation. Next, we outline how insights from HTE analyses can be incorporated into the design of clinical trials. Finally, we propose a research agenda for advancing the field and bringing HTE approaches to the bedside.
Collapse
Affiliation(s)
- Emma J Graham Linck
- Department of Biostatistics and Medical Informatics, UW-Madison, Madison, WI
| | - Ewan C Goligher
- Interdepartmental Division of Critical Care Medicine, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Department of Physiology, University of Toronto, Toronto, ON, Canada
- Toronto General Hospital Research Institute, University Health Network, Toronto, ON, Canada
| | - Matthew W Semler
- Division of Allergy, Pulmonary, and Critical Care Medicine, Vanderbilt University Medical Center, Nashville, TN
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN
| | - Matthew M Churpek
- Department of Biostatistics and Medical Informatics, UW-Madison, Madison, WI
- Division of Pulmonary and Critical Care, Department of Medicine, University of Wisconsin-Madison, Madison, WI
| |
Collapse
|
4
|
Carrillo-Larco RM, Bravo-Rocca G, Castillo-Cara M, Xu X, Bernabe-Ortiz A. A multimodal approach using fundus images and text meta-data in a machine learning classifier with embeddings to predict years with self-reported diabetes - An exploratory analysis. Prim Care Diabetes 2024; 18:327-332. [PMID: 38616442 DOI: 10.1016/j.pcd.2024.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/17/2024] [Accepted: 04/09/2024] [Indexed: 04/16/2024]
Abstract
AIMS Machine learning models can use image and text data to predict the number of years since diabetes diagnosis; such model can be applied to new patients to predict, approximately, how long the new patient may have lived with diabetes unknowingly. We aimed to develop a model to predict self-reported diabetes duration. METHODS We used the Brazilian Multilabel Ophthalmological Dataset. Unit of analysis was the fundus image and its meta-data, regardless of the patient. We included people 40 + years and fundus images without diabetic retinopathy. Fundus images and meta-data (sex, age, comorbidities and taking insulin) were passed to the MedCLIP model to extract the embedding representation. The embedding representation was passed to an Extra Tree Classifier to predict: 0-4, 5-9, 10-14 and 15 + years with self-reported diabetes. RESULTS There were 988 images from 563 people (mean age = 67 years; 64 % were women). Overall, the F1 score was 57 %. The group 15 + years of self-reported diabetes had the highest precision (64 %) and F1 score (63 %), while the highest recall (69 %) was observed in the group 0-4 years. The proportion of correctly classified observations was 55 % for the group 0-4 years, 51 % for 5-9 years, 58 % for 10-14 years, and 64 % for 15 + years with self-reported diabetes. CONCLUSIONS The machine learning model had acceptable accuracy and F1 score, and correctly classified more than half of the patients according to diabetes duration. Using large foundational models to extract image and text embeddings seems a feasible and efficient approach to predict years living with self-reported diabetes.
Collapse
Affiliation(s)
- Rodrigo M Carrillo-Larco
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA; Emory Global Diabetes Research Center, Emory University, Atlanta, GA, USA.
| | | | | | - Xiaolin Xu
- School of Public Health, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China; The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China; School of Public Health, Faculty of Medicine, The University of Queensland, Brisbane, Australia
| | | |
Collapse
|
5
|
Kalinin MN, Khasanova DR. Heterogeneous treatment effects of Cerebrolysin as an early add-on to reperfusion therapy: post hoc analysis of the CEREHETIS trial. Front Pharmacol 2024; 14:1288718. [PMID: 38249342 PMCID: PMC10796496 DOI: 10.3389/fphar.2023.1288718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 12/11/2023] [Indexed: 01/23/2024] Open
Abstract
Background: There has been intensive research into enhancing the effects of reperfusion therapy to mitigate hemorrhagic transformation (HT) in stroke patients. Using neuroprotective agents alongside intravenous thrombolysis (IVT) appears a promising approach. Cerebrolysin is one of the candidates since it consists of neuropeptides mimicking the action of neurotrophic factors on brain protection and repair. Objectives: We looked at treatment effects of Cerebrolysin as an early add-on to IVT in stroke patients with varying HT risk. Methods: It was post hoc analysis of the CEREHETIS trial (ISRCTN87656744). Patients with middle cerebral artery infarction (n = 238) were selected from the intention-to-treat population. To stratify participants according to their HT risk, the DRAGON, SEDAN and HTI scores were computed for each eligible subject using on-admission data. The study endpoints were any and symptomatic HT, and functional outcome measured with the modified Rankin Scale (mRS) on day 90. Favorable functional outcome (FFO) was defined as an mRS ≤2. The performance of each stratification tool was estimated with regression approaches. Heterogeneous treatment effect analysis was conducted using techniques of meta-analysis and the matching-smoothing method. Results: The HTI score outperformed other tools in terms of HT risk stratification. Heterogeneity of Cerebrolysin treatment effects was moderate (I2, 35.8%-56.7%; H2, 1.56-2.31) and mild (I2, 10.9%; H2, 1.12) for symptomatic and any HT, respectively. A significant positive impact of Cerebrolysin on HT and functional outcome was observed in the moderate (HTI = 1) and high (HTI ≥2) HT risk patients, but it was neutral in those with the low (HTI = 0) risk. In particular, there was a steady decline in the rate of symptomatic (HTI = 0 vs. HTI = 4: by 4.3%, p = 0.077 vs. 21.1%, p < 0.001) and any HT (HTI = 0 vs. HTI = 4: by 1.2%, p = 0.737 vs. 32.7%, p < 0.001). Likewise, an mRS score reduction (HTI = 0 vs. HTI = 4: by 1.8%, p = 0.903 vs. 126%, p < 0.001) with a reciprocal increase of the fraction of FFO patients (HTI = 0 vs. HTI = 4: by 1.2% p = 0.757 vs. 35.5%, p < 0.001) was found. Conclusion: Clinically meaningful heterogeneity of Cerebrolysin treatment effects on HT and functional outcome was established in stroke patients. The beneficial effects were significant in those whose estimated on-admission HT risk was either moderate or high.
Collapse
Affiliation(s)
- Mikhail N. Kalinin
- Department of Neurology, Kazan State Medical University, Kazan, Russia
- Department of Neurology, Interregional Clinical Diagnostic Center, Kazan, Russia
| | - Dina R. Khasanova
- Department of Neurology, Kazan State Medical University, Kazan, Russia
- Department of Neurology, Interregional Clinical Diagnostic Center, Kazan, Russia
| |
Collapse
|
6
|
Kalinin MN, Khasanova DR. [Cerebrolysin as an early add-on to reperfusion therapy: heterogeneous treatment effect analysis in ischemic stroke patients with varying risk of hemorrhagic transformation]. Zh Nevrol Psikhiatr Im S S Korsakova 2024; 124:55-66. [PMID: 38512096 DOI: 10.17116/jnevro202412403255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2024]
Abstract
OBJECTIVE The study goal was the assessment of heterogeneous treatment effects of Cerebrolysin as an early add-on to reperfusion therapy in stroke patients with varying risk of hemorrhagic transformation (HT). MATERIAL AND METHODS It was post hoc analysis of the CEREHETIS trial (ISRCTN87656744). Patients with middle cerebral artery infarction (n=238) were stratified by HT risk with the HTI score. The study outcomes were symptomatic and any HT, and functional outcome measured with the modified Rankin Scale (mRS) on day 90. Favorable outcome was defined as an mRS score of ≤2. Heterogeneous treatment effect analysis was performed using techniques of meta-analysis and the matching-smoothing method. RESULTS Heterogeneity of Cerebrolysin treatment effects was moderate (I2=36.98-69.3%, H2=1.59-3.26) and mild (I2=18.33-32.39%, H2=1.22-1.48) for symptomatic and any HT, respectively. A positive impact of the Cerebrolysin treatment on HT and functional outcome was observed in patients with moderate (HTI=1) and high (HTI≥2) HT risk. However, the effect was neutral in those with low risk (HTI=0). In high HT risk patients, there was a steady decline in the rate of symptomatic (HTI=0 vs. HTI≥2: by 3.8%, p=0.120 vs. 14.3%, p<0.001) and any HT (HTI=0 vs. HTI≥2: by 0.6%, p=0.864 vs. 19.5%, p<0.001). Likewise, Cerebrolysin treatment resulted in an overall decrease in the mRS scores (HTI=0 vs. HTI≥2: by 2.1%, p=0.893 vs. 63%, p<0.001) with a reciprocal increase of the fraction with favorable outcome (HTI=0 vs. HTI≥2: by 2% p=0.634 vs. 19.2%, p<0.001). CONCLUSION Clinically meaningful heterogeneity of Cerebrolysin treatment effects on HT and functional outcome was established in stroke patients. The Cerebrolysin positive impact was significant in those whose estimated on-admission HT risk was either moderate or high.
Collapse
Affiliation(s)
- M N Kalinin
- Kazan State Medical University, Kazan, Russia
- Interregional Clinical Diagnostic Center, Kazan, Russia
| | - D R Khasanova
- Kazan State Medical University, Kazan, Russia
- Interregional Clinical Diagnostic Center, Kazan, Russia
| |
Collapse
|
7
|
Misra S, Wagner R, Ozkan B, Schön M, Sevilla-Gonzalez M, Prystupa K, Wang CC, Kreienkamp RJ, Cromer SJ, Rooney MR, Duan D, Thuesen ACB, Wallace AS, Leong A, Deutsch AJ, Andersen MK, Billings LK, Eckel RH, Sheu WHH, Hansen T, Stefan N, Goodarzi MO, Ray D, Selvin E, Florez JC, Meigs JB, Udler MS. Precision subclassification of type 2 diabetes: a systematic review. COMMUNICATIONS MEDICINE 2023; 3:138. [PMID: 37798471 PMCID: PMC10556101 DOI: 10.1038/s43856-023-00360-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 09/15/2023] [Indexed: 10/07/2023] Open
Abstract
BACKGROUND Heterogeneity in type 2 diabetes presentation and progression suggests that precision medicine interventions could improve clinical outcomes. We undertook a systematic review to determine whether strategies to subclassify type 2 diabetes were associated with high quality evidence, reproducible results and improved outcomes for patients. METHODS We searched PubMed and Embase for publications that used 'simple subclassification' approaches using simple categorisation of clinical characteristics, or 'complex subclassification' approaches which used machine learning or 'omics approaches in people with established type 2 diabetes. We excluded other diabetes subtypes and those predicting incident type 2 diabetes. We assessed quality, reproducibility and clinical relevance of extracted full-text articles and qualitatively synthesised a summary of subclassification approaches. RESULTS Here we show data from 51 studies that demonstrate many simple stratification approaches, but none have been replicated and many are not associated with meaningful clinical outcomes. Complex stratification was reviewed in 62 studies and produced reproducible subtypes of type 2 diabetes that are associated with outcomes. Both approaches require a higher grade of evidence but support the premise that type 2 diabetes can be subclassified into clinically meaningful subtypes. CONCLUSION Critical next steps toward clinical implementation are to test whether subtypes exist in more diverse ancestries and whether tailoring interventions to subtypes will improve outcomes.
Collapse
Affiliation(s)
- Shivani Misra
- Department of Metabolism, Digestion and Reproduction, Imperial College London, London, UK.
- Department of Diabetes and Endocrinology, Imperial College Healthcare NHS Trust, London, UK.
| | - Robert Wagner
- Department of Endocrinology and Diabetology, University Hospital Düsseldorf, Heinrich Heine University Düsseldorf, Moorenstr. 5, 40225, Düsseldorf, Germany
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Auf'm Hennekamp 65, 40225, Düsseldorf, Germany
- German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Bige Ozkan
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Ciccarone Center for the Prevention of Cardiovascular Disease, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Martin Schön
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Auf'm Hennekamp 65, 40225, Düsseldorf, Germany
- German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
- Institute of Experimental Endocrinology, Biomedical Research Center, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Magdalena Sevilla-Gonzalez
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital, Boston, MA, USA
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Katsiaryna Prystupa
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Auf'm Hennekamp 65, 40225, Düsseldorf, Germany
- German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Caroline C Wang
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Raymond J Kreienkamp
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Pediatrics, Division of Endocrinology, Boston Children's Hospital, Boston, MA, USA
| | - Sara J Cromer
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Mary R Rooney
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Daisy Duan
- Division of Endocrinology, Diabetes and Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Anne Cathrine Baun Thuesen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Amelia S Wallace
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Aaron Leong
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA
- Division of General Internal Medicine, Massachusetts General Hospital, 100 Cambridge St 16th Floor, Boston, MA, USA
| | - Aaron J Deutsch
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Mette K Andersen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Liana K Billings
- Division of Endocrinology, Diabetes and Metabolism, NorthShore University Health System, Skokie, IL, USA
- Department of Medicine, Pritzker School of Medicine, University of Chicago, Chicago, IL, USA
| | - Robert H Eckel
- Division of Endocrinology, Metabolism and Diabetes, University of Colorado School of Medicine, Aurora, CO, USA
| | - Wayne Huey-Herng Sheu
- Institute of Molecular and Genomic Medicine, National Health Research Institute, Miaoli County, Taiwan, ROC
- Division of Endocrinology and Metabolism, Taichung Veterans General Hospital, Taichung, Taiwan, ROC
- Division of Endocrinology and Metabolism, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
| | - Torben Hansen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Norbert Stefan
- German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
- University Hospital of Tübingen, Tübingen, Germany
- Institute of Diabetes Research and Metabolic Diseases (IDM), Helmholtz Center Munich, Neuherberg, Germany
| | - Mark O Goodarzi
- Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Debashree Ray
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Elizabeth Selvin
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Jose C Florez
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - James B Meigs
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Division of General Internal Medicine, Massachusetts General Hospital, 100 Cambridge St 16th Floor, Boston, MA, USA
| | - Miriam S Udler
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| |
Collapse
|
8
|
Jawadekar N, Kezios K, Odden MC, Stingone JA, Calonico S, Rudolph K, Zeki Al Hazzouri A. Practical Guide to Honest Causal Forests for Identifying Heterogeneous Treatment Effects. Am J Epidemiol 2023; 192:1155-1165. [PMID: 36843042 DOI: 10.1093/aje/kwad043] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 12/05/2022] [Accepted: 02/20/2023] [Indexed: 02/28/2023] Open
Abstract
"Heterogeneous treatment effects" is a term which refers to conditional average treatment effects (i.e., CATEs) that vary across population subgroups. Epidemiologists are often interested in estimating such effects because they can help detect populations that may particularly benefit from or be harmed by a treatment. However, standard regression approaches for estimating heterogeneous effects are limited by preexisting hypotheses, test a single effect modifier at a time, and are subject to the multiple-comparisons problem. In this article, we aim to offer a practical guide to honest causal forests, an ensemble tree-based learning method which can discover as well as estimate heterogeneous treatment effects using a data-driven approach. We discuss the fundamentals of tree-based methods, describe how honest causal forests can identify and estimate heterogeneous effects, and demonstrate an implementation of this method using simulated data. Our implementation highlights the steps required to simulate data sets, build honest causal forests, and assess model performance across a variety of simulation scenarios. Overall, this paper is intended for epidemiologists and other population health researchers who lack an extensive background in machine learning yet are interested in utilizing an emerging method for identifying and estimating heterogeneous treatment effects.
Collapse
|
9
|
Venkatasubramaniam A, Mateen BA, Shields BM, Hattersley AT, Jones AG, Vollmer SJ, Dennis JM. Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine. BMC Med Inform Decis Mak 2023; 23:110. [PMID: 37328784 PMCID: PMC10276367 DOI: 10.1186/s12911-023-02207-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 06/01/2023] [Indexed: 06/18/2023] Open
Abstract
OBJECTIVE Precision medicine requires reliable identification of variation in patient-level outcomes with different available treatments, often termed treatment effect heterogeneity. We aimed to evaluate the comparative utility of individualized treatment selection strategies based on predicted individual-level treatment effects from a causal forest machine learning algorithm and a penalized regression model. METHODS Cohort study characterizing individual-level glucose-lowering response (6 month reduction in HbA1c) in people with type 2 diabetes initiating SGLT2-inhibitor or DPP4-inhibitor therapy. Model development set comprised 1,428 participants in the CANTATA-D and CANTATA-D2 randomised clinical trials of SGLT2-inhibitors versus DPP4-inhibitors. For external validation, calibration of observed versus predicted differences in HbA1c in patient strata defined by size of predicted HbA1c benefit was evaluated in 18,741 patients in UK primary care (Clinical Practice Research Datalink). RESULTS Heterogeneity in treatment effects was detected in clinical trial participants with both approaches (proportion predicted to have a benefit on SGLT2-inhibitor therapy over DPP4-inhibitor therapy: causal forest: 98.6%; penalized regression: 81.7%). In validation, calibration was good with penalized regression but sub-optimal with causal forest. A strata with an HbA1c benefit > 10 mmol/mol with SGLT2-inhibitors (3.7% of patients, observed benefit 11.0 mmol/mol [95%CI 8.0-14.0]) was identified using penalized regression but not causal forest, and a much larger strata with an HbA1c benefit 5-10 mmol with SGLT2-inhibitors was identified with penalized regression (regression: 20.9% of patients, observed benefit 7.8 mmol/mol (95%CI 6.7-8.9); causal forest 11.6%, observed benefit 8.7 mmol/mol (95%CI 7.4-10.1). CONCLUSIONS Consistent with recent results for outcome prediction with clinical data, when evaluating treatment effect heterogeneity researchers should not rely on causal forest or other similar machine learning algorithms alone, and must compare outputs with standard regression, which in this evaluation was superior.
Collapse
Affiliation(s)
| | - Bilal A Mateen
- The Alan Turing Institute, British Library, 96 Euston Road, London, NW1 2DB, UK
- University College London, Institute of Health Informatics, 222 Euston Rd, London, NW1 2DA, UK
| | - Beverley M Shields
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK
| | - Andrew T Hattersley
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK
| | - Angus G Jones
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK
| | | | - John M Dennis
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK.
| |
Collapse
|
10
|
Misra S, Wagner R, Ozkan B, Schön M, Sevilla-Gonzalez M, Prystupa K, Wang CC, Kreienkamp RJ, Cromer SJ, Rooney MR, Duan D, Thuesen ACB, Wallace AS, Leong A, Deutsch AJ, Andersen MK, Billings LK, Eckel RH, Sheu WHH, Hansen T, Stefan N, Goodarzi MO, Ray D, Selvin E, Florez JC, Meigs JB, Udler MS. Systematic review of precision subclassification of type 2 diabetes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.04.19.23288577. [PMID: 37131632 PMCID: PMC10153304 DOI: 10.1101/2023.04.19.23288577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Heterogeneity in type 2 diabetes presentation, progression and treatment has the potential for precision medicine interventions that can enhance care and outcomes for affected individuals. We undertook a systematic review to ascertain whether strategies to subclassify type 2 diabetes are associated with improved clinical outcomes, show reproducibility and have high quality evidence. We reviewed publications that deployed 'simple subclassification' using clinical features, biomarkers, imaging or other routinely available parameters or 'complex subclassification' approaches that used machine learning and/or genomic data. We found that simple stratification approaches, for example, stratification based on age, body mass index or lipid profiles, had been widely used, but no strategy had been replicated and many lacked association with meaningful outcomes. Complex stratification using clustering of simple clinical data with and without genetic data did show reproducible subtypes of diabetes that had been associated with outcomes such as cardiovascular disease and/or mortality. Both approaches require a higher grade of evidence but support the premise that type 2 diabetes can be subclassified into meaningful groups. More studies are needed to test these subclassifications in more diverse ancestries and prove that they are amenable to interventions.
Collapse
|
11
|
Wolf JM, Koopmeiners JS, Vock DM. A permutation procedure to detect heterogeneous treatment effects in randomized clinical trials while controlling the type I error rate. Clin Trials 2022; 19:512-521. [PMID: 35531765 PMCID: PMC9529771 DOI: 10.1177/17407745221095855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND/AIMS Secondary analyses of randomized clinical trials often seek to identify subgroups with differential treatment effects. These discoveries can help guide individual treatment decisions based on patient characteristics and identify populations for which additional treatments are needed. Traditional analyses require researchers to pre-specify potential subgroups to reduce the risk of reporting spurious results. There is a need for methods that can detect such subgroups without a priori specification while allowing researchers to control the probability of falsely detecting heterogeneous subgroups when treatment effects are uniform across the study population. METHODS We propose a permutation procedure for tuning parameter selection that allows for type I error control when testing for heterogeneous treatment effects framed within the Virtual Twins procedure for subgroup identification. We verify that the type I error rate can be controlled at the nominal rate and investigate the power for detecting heterogeneous effects when present through extensive simulation studies. We apply our method to a secondary analysis of data from a randomized trial of very low nicotine content cigarettes. RESULTS In the absence of type I error control, the observed type I error rate for Virtual Twins was between 99% and 100%. In contrast, models tuned via the proposed permutation were able to control the type I error rate and detect heterogeneous effects when present. An application of our approach to a recently completed trial of very low nicotine content cigarettes identified several variables with potentially heterogeneous treatment effects. CONCLUSIONS The proposed permutation procedure allows researchers to engage in secondary analyses of clinical trials for treatment effect heterogeneity while maintaining the type I error rate without pre-specifying subgroups.
Collapse
Affiliation(s)
- Jack M. Wolf
- Division of Biostatistics, School of Public Health,
University of Minnesota, Minneapolis, Minnesota, United States
| | - Joseph S. Koopmeiners
- Division of Biostatistics, School of Public Health,
University of Minnesota, Minneapolis, Minnesota, United States
| | - David M. Vock
- Division of Biostatistics, School of Public Health,
University of Minnesota, Minneapolis, Minnesota, United States
| |
Collapse
|
12
|
Edward JA, Josey K, Bahn G, Caplan L, Reusch JEB, Reaven P, Ghosh D, Raghavan S. Heterogeneous treatment effects of intensive glycemic control on major adverse cardiovascular events in the ACCORD and VADT trials: a machine-learning analysis. Cardiovasc Diabetol 2022; 21:58. [PMID: 35477454 PMCID: PMC9047276 DOI: 10.1186/s12933-022-01496-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 03/31/2022] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Evidence to guide type 2 diabetes treatment individualization is limited. We evaluated heterogeneous treatment effects (HTE) of intensive glycemic control in type 2 diabetes patients on major adverse cardiovascular events (MACE) in the Action to Control Cardiovascular Risk in Diabetes Study (ACCORD) and the Veterans Affairs Diabetes Trial (VADT). METHODS Causal forests machine learning analysis was performed using pooled individual data from two randomized trials (n = 12,042) to identify HTE of intensive versus standard glycemic control on MACE in patients with type 2 diabetes. We used variable prioritization from causal forests to build a summary decision tree and examined the risk difference of MACE between treatment arms in the resulting subgroups. RESULTS A summary decision tree used five variables (hemoglobin glycation index, estimated glomerular filtration rate, fasting glucose, age, and body mass index) to define eight subgroups in which risk differences of MACE ranged from - 5.1% (95% CI - 8.7, - 1.5) to 3.1% (95% CI 0.2, 6.0) (negative values represent lower MACE associated with intensive glycemic control). Intensive glycemic control was associated with lower MACE in pooled study data in subgroups with low (- 4.2% [95% CI - 8.1, - 1.0]), intermediate (- 5.1% [95% CI - 8.7, - 1.5]), and high (- 4.3% [95% CI - 7.7, - 1.0]) MACE rates with consistent directions of effect in ACCORD and VADT alone. CONCLUSIONS This data-driven analysis provides evidence supporting the diabetes treatment guideline recommendation of intensive glucose lowering in diabetes patients with low cardiovascular risk and additionally suggests potential benefits of intensive glycemic control in some individuals at higher cardiovascular risk.
Collapse
Affiliation(s)
- Justin A. Edward
- grid.430503.10000 0001 0703 675XDivision of Cardiology, University of Colorado School of Medicine, Aurora, CO USA
| | - Kevin Josey
- grid.422100.50000 0000 9751 469XDepartment of Veterans Affairs Eastern Colorado Healthcare System, Rocky Mountain, Regional VA Medical Center, Medicine Service (111), 1700 North Wheeling Street, Aurora, CO 80045 USA ,grid.414594.90000 0004 0401 9614Department of Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO USA
| | - Gideon Bahn
- grid.280893.80000 0004 0419 5175Department of Veterans Affairs, Hines VA Hospital, Hines, IL USA
| | - Liron Caplan
- grid.422100.50000 0000 9751 469XDepartment of Veterans Affairs Eastern Colorado Healthcare System, Rocky Mountain, Regional VA Medical Center, Medicine Service (111), 1700 North Wheeling Street, Aurora, CO 80045 USA ,grid.430503.10000 0001 0703 675XDivision of Rheumatology, University of Colorado School of Medicine, Aurora, CO USA
| | - Jane E. B. Reusch
- grid.422100.50000 0000 9751 469XDepartment of Veterans Affairs Eastern Colorado Healthcare System, Rocky Mountain, Regional VA Medical Center, Medicine Service (111), 1700 North Wheeling Street, Aurora, CO 80045 USA ,grid.430503.10000 0001 0703 675XDivision of Endocrinology, Metabolism, and Diabetes, University of Colorado School of Medicine, Aurora, CO USA
| | - Peter Reaven
- grid.280893.80000 0004 0419 5175Department of Veterans Affairs Phoenix VA Medical Center, Phoenix, AZ USA
| | - Debashis Ghosh
- grid.414594.90000 0004 0401 9614Department of Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO USA
| | - Sridharan Raghavan
- grid.422100.50000 0000 9751 469XDepartment of Veterans Affairs Eastern Colorado Healthcare System, Rocky Mountain, Regional VA Medical Center, Medicine Service (111), 1700 North Wheeling Street, Aurora, CO 80045 USA ,grid.430503.10000 0001 0703 675XDivision of Biomedical Informatics and Personalized Medicine, University of Colorado School of Medicine, Aurora, CO USA ,grid.512286.aColorado Cardiovascular Outcomes Research Consortium, Aurora, CO USA
| |
Collapse
|