1
|
Westgate PM, Nigam SR, Shoben AB. Reconsidering stepped wedge cluster randomized trial designs with implementation periods: Fewer sequences or the parallel-group design with baseline and implementation periods are potentially more efficient. Clin Trials 2024; 21:710-722. [PMID: 38650332 PMCID: PMC11493850 DOI: 10.1177/17407745241244790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
BACKGROUND/AIMS When designing a cluster randomized trial, advantages and disadvantages of tentative designs must be weighed. The stepped wedge design is popular for multiple reasons, including its potential to increase power via improved efficiency relative to a parallel-group design. In many realistic settings, it will take time for clusters to fully implement the intervention. When designing the HEALing (Helping to End Addiction Long-termSM) Communities Study, implementation time was a major consideration, and we examined the efficiency and practicality of three designs. Specifically, a three-sequence stepped wedge design with implementation periods, a corresponding two-sequence modified design that is created by removing the middle sequence, and a parallel-group design with baseline and implementation periods. In this article, we study the relative efficiencies of these specific designs. More generally, we study the relative efficiencies of modified designs when the stepped wedge design with implementation periods has three or more sequences. We also consider different correlation structures. METHODS We compare efficiencies of stepped wedge designs with implementation periods consisting of three to nine sequences with a variety of corresponding designs. The three-sequence design is compared to the two-sequence modified design and to the parallel-group design with baseline and implementation periods analysed via analysis of covariance. Stepped wedge designs with implementation periods consisting of four or more sequences are compared to modified designs that remove all or a subset of 'middle' sequences. Efficiencies are based on the use of linear mixed effects models. RESULTS In the studied settings, the modified design is more efficient than the three-sequence stepped wedge design with implementation periods. The parallel-group design with baseline and implementation periods with analysis of covariance-based analysis is often more efficient than the three-sequence design. With respect to stepped wedge designs with implementation periods that are comprised of more sequences, there are often corresponding modified designs that improve efficiency. However, use of only the first and last sequences has the potential to be either relatively efficient or inefficient. Relative efficiency is impacted by the strength of the statistical correlation among outcomes from the same cluster; for example, the relative efficiencies of modified designs tend to be greater for smaller cluster auto-correlation values. CONCLUSION If a three-sequence stepped wedge design with implementation periods is being considered for a future cluster randomized trial, then a corresponding modified design using only the first and last sequences should be considered if sole focus is on efficiency. However, a parallel-group design with baseline and implementation periods and analysis of covariance-based analysis can be a practical, efficient alternative. For stepped wedge designs with implementation periods and a larger number of sequences, modified versions that remove 'middle' sequences should be considered. Due to the potential sensitivity of design efficiencies, statistical correlation should be carefully considered.
Collapse
Affiliation(s)
- Philip M Westgate
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Shawn R Nigam
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Abigail B Shoben
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
2
|
Moerbeek M. Optimal design of cluster randomized crossover trials with a continuous outcome: Optimal number of time periods and treatment switches under a fixed number of clusters or fixed budget. Behav Res Methods 2024; 56:8820-8830. [PMID: 39271634 PMCID: PMC11525278 DOI: 10.3758/s13428-024-02505-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/25/2024] [Indexed: 09/15/2024]
Abstract
In the cluster randomized crossover trial, a sequence of treatment conditions, rather than just one treatment condition, is assigned to each cluster. This contribution studies the optimal number of time periods in studies with a treatment switch at the end of each time period, and the optimal number of treatment switches in a trial with a fixed number of time periods. This is done for trials with a fixed number of clusters, and for trials in which the costs per cluster, subject, and treatment switch are taken into account using a budgetary constraint. The focus is on trials with a cross-sectional design where a continuous outcome variable is measured at the end of each time period. An exponential decay correlation structure is used to model dependencies among subjects within the same cluster. A linear multilevel mixed model is used to estimate the treatment effect and its associated variance. The optimal design minimizes this variance. Matrix algebra is used to identify the optimal design and other highly efficient designs. For a fixed number of clusters, a design with the maximum number of time periods is optimal and treatment switches should occur at each time period. However, when a budgetary constraint is taken into account, the optimal design may have fewer time periods and fewer treatment switches. The Shiny app was developed to facilitate the use of the methodology in this contribution.
Collapse
Affiliation(s)
- Mirjam Moerbeek
- Department of Methodology and Statistics, Utrecht University, PO Box 80140, Utrecht, TC, 3508, The Netherlands.
| |
Collapse
|
3
|
Hooper R, Quintin O, Kasza J. Efficient designs for three-sequence stepped wedge trials with continuous recruitment. Clin Trials 2024; 21:723-733. [PMID: 38773924 PMCID: PMC11528865 DOI: 10.1177/17407745241251780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2024]
Abstract
BACKGROUND/AIMS The standard approach to designing stepped wedge trials that recruit participants in a continuous stream is to divide time into periods of equal length. But the choice of design in such cases is infinitely more flexible: each cluster could cross from the control to the intervention at any point on the continuous time-scale. We consider the case of a stepped wedge design with clusters randomised to just three sequences (designs with small numbers of sequences may be preferred for their simplicity and practicality) and investigate the choice of design that minimises the variance of the treatment effect estimator under different assumptions about the intra-cluster correlation. METHODS We make some simplifying assumptions in order to calculate the variance: in particular that we recruit the same number of participants, m , from each cluster over the course of the trial, and that participants present at regularly spaced intervals. We consider an intra-cluster correlation that decays exponentially with separation in time between the presentation of two individuals from the same cluster, from a value of ρ for two individuals who present at the same time, to a value of ρ τ for individuals presenting at the start and end of the trial recruitment interval. We restrict attention to three-sequence designs with centrosymmetry - the property that if we reverse time and swap the intervention and control conditions then the design looks the same. We obtain an expression for the variance of the treatment effect estimator adjusted for effects of time, using methods for generalised least squares estimation, and we evaluate this expression numerically for different designs, and for different parameter values. RESULTS There is a two-dimensional space of possible three-sequence, centrosymmetric stepped wedge designs with continuous recruitment. The variance of the treatment effect estimator for given ρ and τ can be plotted as a contour map over this space. The shape of this variance surface depends on τ and on the parameter m ρ / ( 1 - ρ ) , but typically indicates a broad, flat region of close-to-optimal designs. The 'standard' design with equally spaced periods and 1:1:1 allocation rarely performs well, however. CONCLUSIONS In many different settings, a relatively simple design can be found (e.g. one based on simple fractions) that offers close-to-optimal efficiency in that setting. There may also be designs that are robustly efficient over a wide range of settings. Contour maps of the kind we illustrate can help guide this choice. If efficiency is offered as one of the justifications for using a stepped wedge design, then it is worth designing with optimal efficiency in mind.
Collapse
Affiliation(s)
- Richard Hooper
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - Olivier Quintin
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| |
Collapse
|
4
|
Ouyang Y, Taljaard M, Forbes AB, Li F. Maintaining the validity of inference from linear mixed models in stepped-wedge cluster randomized trials under misspecified random-effects structures. Stat Methods Med Res 2024; 33:1497-1516. [PMID: 38807552 PMCID: PMC11499024 DOI: 10.1177/09622802241248382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/30/2024]
Abstract
Linear mixed models are commonly used in analyzing stepped-wedge cluster randomized trials. A key consideration for analyzing a stepped-wedge cluster randomized trial is accounting for the potentially complex correlation structure, which can be achieved by specifying random-effects. The simplest random effects structure is random intercept but more complex structures such as random cluster-by-period, discrete-time decay, and more recently, the random intervention structure, have been proposed. Specifying appropriate random effects in practice can be challenging: assuming more complex correlation structures may be reasonable but they are vulnerable to computational challenges. To circumvent these challenges, robust variance estimators may be applied to linear mixed models to provide consistent estimators of standard errors of fixed effect parameters in the presence of random-effects misspecification. However, there has been no empirical investigation of robust variance estimators for stepped-wedge cluster randomized trials. In this article, we review six robust variance estimators (both standard and small-sample bias-corrected robust variance estimators) that are available for linear mixed models in R, and then describe a comprehensive simulation study to examine the performance of these robust variance estimators for stepped-wedge cluster randomized trials with a continuous outcome under different data generators. For each data generator, we investigate whether the use of a robust variance estimator with either the random intercept model or the random cluster-by-period model is sufficient to provide valid statistical inference for fixed effect parameters, when these working models are subject to random-effect misspecification. Our results indicate that the random intercept and random cluster-by-period models with robust variance estimators performed adequately. The CR3 robust variance estimator (approximate jackknife) estimator, coupled with the number of clusters minus two degrees of freedom correction, consistently gave the best coverage results, but could be slightly conservative when the number of clusters was below 16. We summarize the implications of our results for the linear mixed model analysis of stepped-wedge cluster randomized trials and offer some practical recommendations on the choice of the analytic model.
Collapse
Affiliation(s)
- Yongdong Ouyang
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| | - Andrew B Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, CT, USA
| |
Collapse
|
5
|
Brusco NK, Ekegren CL, Morris ME, Hill KD, Lee AL, Somerville L, Lannin NA, Abdelmotaleb R, Callaway L, Whittaker SL, Taylor NF. Outcomes of the My Therapy self-management program in people admitted for rehabilitation: A stepped wedge cluster randomized clinical trial. Ann Phys Rehabil Med 2024; 67:101867. [PMID: 39173328 DOI: 10.1016/j.rehab.2024.101867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 05/07/2024] [Accepted: 05/19/2024] [Indexed: 08/24/2024]
Abstract
BACKGROUND Self-management programs can increase the time spent on prescribed therapeutic exercises and activities in rehabilitation inpatients, which has been associated with better functional outcomes and shorter hospital stays. OBJECTIVES To determine whether implementation of a self-management program ('My Therapy') improves functional independence relative to routine care in people admitted for physical rehabilitation. METHODS This stepped wedge, cluster randomized trial was conducted over 54 weeks (9 periods of 6-week duration, April 2021 - April 2022) across 9 clusters (general rehabilitation wards) within 4 hospitals (Victoria, Australia). We included all adults (≥18 years) admitted for rehabilitation to participating wards. The intervention included routine care plus 'My Therapy', comprising a sub-set of exercises and activities from supervised sessions which could be performed safely, without supervision or assistance. The primary outcomes were the proportion of participants achieving a minimal clinically important difference (MCID) in the Functional Independence Measure, (FIM™) and change in total FIM™ score from admission to discharge. RESULTS 2550 participants (62 % women) were recruited (control: n = 1458, intervention: n = 1092), with mean (SD) age 77 (13) years and 37 % orthopedic diagnosis. Under intervention conditions, participants reported a mean (SD) of 29 (21) minutes/day of self-directed therapy, compared to 4 (SD 14) minutes/day, under control conditions. There was no evidence of a difference between control and intervention conditions in the odds of achieving an MCID in FIM™ (adjusted odds ratio 0.93, 95 % CI 0.65 to 1.31), or in the change in FIM™ score (adjusted mean difference: -0.27 units, 95 % CI -2.67 to 2.13). CONCLUSIONS My Therapy was delivered safely to a large, diverse sample of participants admitted for rehabilitation, with an increase in daily rehabilitation dosage. However, given the lack of difference in functional improvement with participation in My Therapy, self-management programs may need to be supplemented with other strategies to improve function in people admitted for rehabilitation. TRIAL REGISTRATION Australian New Zealand Clinical Trials Registry (ACTRN12621000313831), https://www.anzctr.org.au/.
Collapse
Affiliation(s)
- Natasha K Brusco
- Rehabilitation, Ageing and Independent Living (RAIL) Research Centre, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia.
| | - Christina L Ekegren
- Rehabilitation, Ageing and Independent Living (RAIL) Research Centre, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia
| | - Meg E Morris
- Academic and Research Collaborative in Health (ARCH), La Trobe University, Bundoora, VIC 3086 Australia
| | - Keith D Hill
- Rehabilitation, Ageing and Independent Living (RAIL) Research Centre, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia
| | - Annemarie L Lee
- Department of Physiotherapy, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia
| | | | - Natasha A Lannin
- Department of Neuroscience, School of Translational Medicine, Monash University, Melbourne, VIC 3004 Australia; Alfred Health, Melbourne, VIC 3004 Australia
| | | | - Libby Callaway
- Rehabilitation, Ageing and Independent Living (RAIL) Research Centre, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia; Occupational Therapy Department, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia
| | - Sara L Whittaker
- Rehabilitation, Ageing and Independent Living (RAIL) Research Centre, Monash University, 47-49 Moorooduc Hwy, Frankston, VIC 3199 Australia
| | - Nicholas F Taylor
- Academic and Research Collaborative in Health (ARCH), La Trobe University, Bundoora, VIC 3086 Australia; Eastern Health, 2/5 Arnold Street, Box Hill, VIC 3128 Australia
| |
Collapse
|
6
|
Salway R, Jago R, de Vocht F, House D, Porter A, Walker R, Kipping R, Owen CG, Hudda MT, Northstone K, van Sluijs E. School-level intra-cluster correlation coefficients and autocorrelations for children's accelerometer-measured physical activity in England by age and gender. BMC Med Res Methodol 2024; 24:179. [PMID: 39123109 PMCID: PMC11313128 DOI: 10.1186/s12874-024-02290-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 07/17/2024] [Indexed: 08/12/2024] Open
Abstract
BACKGROUND Randomised, cluster-based study designs in schools are commonly used to evaluate children's physical activity interventions. Sample size estimation relies on accurate estimation of the intra-cluster correlation coefficient (ICC), but published estimates, especially using accelerometry-measured physical activity, are few and vary depending on physical activity outcome and participant age. Less commonly-used cluster-based designs, such as stepped wedge designs, also need to account for correlations over time, e.g. cluster autocorrelation (CAC) and individual autocorrelation (IAC), but no estimates are currently available. This paper estimates the school-level ICC, CAC and IAC for England children's accelerometer-measured physical activity outcomes by age group and gender, to inform the design of future school-based cluster trials. METHODS Data were pooled from seven large English datasets of accelerometer-measured physical activity data between 2002-18 (> 13,500 pupils, 540 primary and secondary schools). Linear mixed effect models estimated ICCs for weekday and whole week for minutes spent in moderate-to-vigorous physical activity (MVPA) and being sedentary for different age groups, stratified by gender. The CAC (1,252 schools) and IAC (34,923 pupils) were estimated by length of follow-up from pooled longitudinal data. RESULTS School-level ICCs for weekday MVPA were higher in primary schools (from 0.07 (95% CI: 0.05, 0.10) to 0.08 (95% CI: 0.06, 0.11)) compared to secondary (from 0.04 (95% CI: 0.03, 0.07) to (95% CI: 0.04, 0.10)). Girls' ICCs were similar for primary and secondary schools, but boys' were lower in secondary. For all ages, combined the CAC was 0.60 (95% CI: 0.44-0.72), and the IAC was 0.46 (95% CI: 0.42-0.49), irrespective of follow-up time. Estimates were higher for MVPA vs sedentary time, and for weekdays vs the whole week. CONCLUSIONS Adequately powered studies are important to evidence effective physical activity strategies. Our estimates of the ICC, CAC and IAC may be used to plan future school-based physical activity evaluations and were fairly consistent across a range of ages and settings, suggesting that results may be applied to other high income countries with similar school physical activity provision. It is important to use estimates appropriate to the study design, and that match the intended study population as closely as possible.
Collapse
Affiliation(s)
- Ruth Salway
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
| | - Russell Jago
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
- The National Institute for Health Research, Applied Research Collaboration West (NIHR ARC West), University Hospitals Bristol and Weston NHS Foundation Trust, Bristol, UK
- NIHR Bristol Biomedical Research Centre, University Hospitals Bristol and Weston NHS Foundation Trust and University of Bristol, Bristol, UK
| | - Frank de Vocht
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
- The National Institute for Health Research, Applied Research Collaboration West (NIHR ARC West), University Hospitals Bristol and Weston NHS Foundation Trust, Bristol, UK
| | - Danielle House
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Alice Porter
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
- NIHR Bristol Biomedical Research Centre, University Hospitals Bristol and Weston NHS Foundation Trust and University of Bristol, Bristol, UK
| | - Robert Walker
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Ruth Kipping
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Christopher G Owen
- Population Health Research Institute, St George's, University of London, London, UK
| | - Mohammed T Hudda
- Department of Population Health, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Kate Northstone
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Esther van Sluijs
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| |
Collapse
|
7
|
Bowden RA, Kasza J, Forbes AB. A simple and effective method for simulating nested exchangeable correlated binary data for longitudinal cluster randomised trials. BMC Med Res Methodol 2024; 24:174. [PMID: 39118054 PMCID: PMC11308151 DOI: 10.1186/s12874-024-02285-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 07/15/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND Simulation is an important tool for assessing the performance of statistical methods for the analysis of data and for the planning of studies. While methods are available for the simulation of correlated binary random variables, all have significant practical limitations for simulating outcomes from longitudinal cluster randomised trial designs, such as the cluster randomised crossover and the stepped wedge trial designs. For these trial designs as the number of observations in each cluster increases these methods either become computationally infeasible or their range of allowable correlations rapidly shrinks to zero. METHODS In this paper we present a simple method for simulating binary random variables with a specified vector of prevalences and correlation matrix. This method allows for the outcome prevalence to change due to treatment or over time, and for a 'nested exchangeable' correlation structure, in which observations in the same cluster are more highly correlated if they are measured in the same time period than in different time periods, and where different individuals are measured in each time period. This means that our method is also applicable to more general hierarchical clustered data contexts, such as students within classrooms within schools. The method is demonstrated by simulating 1000 datasets with parameters matching those derived from data from a cluster randomised crossover trial assessing two variants of stress ulcer prophylaxis. RESULTS Our method is orders of magnitude faster than the most well known general simulation method while also allowing a much wider range of correlations than alternative methods. An implementation of our method is available in an R package NestBin. CONCLUSIONS This simulation method is the first to allow for practical and efficient simulation of large datasets of binary outcomes with the commonly used nested exchangeable correlation structure. This will allow for much more effective testing of designs and inference methods for longitudinal cluster randomised trials with binary outcomes.
Collapse
Affiliation(s)
- Rhys A Bowden
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia.
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - Andrew B Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| |
Collapse
|
8
|
Liu J, Li F. Optimal designs using generalized estimating equations in cluster randomized crossover and stepped wedge trials. Stat Methods Med Res 2024; 33:1299-1330. [PMID: 38813761 DOI: 10.1177/09622802241247717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Cluster randomized crossover and stepped wedge cluster randomized trials are two types of longitudinal cluster randomized trials that leverage both the within- and between-cluster comparisons to estimate the treatment effect and are increasingly used in healthcare delivery and implementation science research. While the variance expressions of estimated treatment effect have been previously developed from the method of generalized estimating equations for analyzing cluster randomized crossover trials and stepped wedge cluster randomized trials, little guidance has been provided for optimal designs to ensure maximum efficiency. Here, an optimal design refers to the combination of optimal cluster-period size and optimal number of clusters that provide the smallest variance of the treatment effect estimator or maximum efficiency under a fixed total budget. In this work, we develop optimal designs for multiple-period cluster randomized crossover trials and stepped wedge cluster randomized trials with continuous outcomes, including both closed-cohort and repeated cross-sectional sampling schemes. Local optimal design algorithms are proposed when the correlation parameters in the working correlation structure are known. MaxiMin optimal design algorithms are proposed when the exact values are unavailable, but investigators may specify a range of correlation values. The closed-form formulae of local optimal design and MaxiMin optimal design are derived for multiple-period cluster randomized crossover trials, where the cluster-period size and number of clusters are decimal. The decimal estimates from closed-form formulae can then be used to investigate the performances of integer estimates from local optimal design and MaxiMin optimal design algorithms. One unique contribution from this work, compared to the previous optimal design research, is that we adopt constrained optimization techniques to obtain integer estimates under the MaxiMin optimal design. To assist practical implementation, we also develop four SAS macros to find local optimal designs and MaxiMin optimal designs.
Collapse
Affiliation(s)
- Jingxia Liu
- Division of Public Health Sciences, Department of Surgery and Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
| | - Fan Li
- Department of Biostatistics, Yale University, New Haven, CT, USA
| |
Collapse
|
9
|
Voldal EC, Kenny A, Xia F, Heagerty P, Hughes JP. Robust analysis of stepped wedge trials using composite likelihood models. Stat Med 2024; 43:3326-3352. [PMID: 38837431 PMCID: PMC11257102 DOI: 10.1002/sim.10120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 05/02/2024] [Accepted: 05/09/2024] [Indexed: 06/07/2024]
Abstract
Stepped wedge trials (SWTs) are a type of cluster randomized trial that involve repeated measures on clusters and design-induced confounding between time and treatment. Although mixed models are commonly used to analyze SWTs, they are susceptible to misspecification particularly for cluster-longitudinal designs such as SWTs. Mixed model estimation leverages both "horizontal" or within-cluster information and "vertical" or between-cluster information. To use horizontal information in a mixed model, both the mean model and correlation structure must be correctly specified or accounted for, since time is confounded with treatment and measurements are likely correlated within clusters. Alternative non-parametric methods have been proposed that use only vertical information; these are more robust because between-cluster comparisons in a SWT preserve randomization, but these non-parametric methods are not very efficient. We propose a composite likelihood method that focuses on vertical information, but has the flexibility to recover efficiency by using additional horizontal information. We compare the properties and performance of various methods, using simulations based on COVID-19 data and a demonstration of application to the LIRE trial. We found that a vertical composite likelihood model that leverages baseline data is more robust than traditional methods, and more efficient than methods that use only vertical information. We hope that these results demonstrate the potential value of model-based vertical methods for SWTs with a large number of clusters, and that these new tools are useful to researchers who are concerned about misspecification of traditional models.
Collapse
Affiliation(s)
| | - Avi Kenny
- Department of Biostatistics & Bioinformatics, Duke University, North Carolina, US
- Global Health Institute, Duke University, North Carolina, US
| | - Fan Xia
- Department of Epidemiology & Biostatistics, University of California San Francisco, California, US
| | - Patrick Heagerty
- Department of Biostatistics, University of Washington, Washington, US
| | - James P. Hughes
- Department of Biostatistics, University of Washington, Washington, US
| |
Collapse
|
10
|
Hughes JP, Lee WY, Troxel AB, Heagerty PJ. Sample Size Calculations for Stepped Wedge Designs with Treatment Effects that May Change with the Duration of Time under Intervention. PREVENTION SCIENCE : THE OFFICIAL JOURNAL OF THE SOCIETY FOR PREVENTION RESEARCH 2024; 25:348-355. [PMID: 37728810 PMCID: PMC10950842 DOI: 10.1007/s11121-023-01587-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/11/2023] [Indexed: 09/21/2023]
Abstract
The stepped wedge design is often used to evaluate interventions as they are rolled out across schools, health clinics, communities, or other clusters. Most models used in the design and analysis of stepped wedge trials assume that the intervention effect is immediate and constant over time following implementation of the intervention (the "exposure time"). This is known as the IT (immediate treatment effect) assumption. However, recent research has shown that using methods based on the IT assumption when the treatment effect varies over exposure time can give extremely misleading results. In this manuscript, we discuss the need to carefully specify an appropriate measure of the treatment effect when the IT assumption is violated and we show how a stepped wedge trial can be powered when it is anticipated that the treatment effect will vary as a function of the exposure time. Specifically, we describe how to power a trial when the exposure time indicator (ETI) model of Kenny et al. (Statistics in Medicine, 41, 4311-4339, 2022) is used and the estimand of interest is a weighted average of the time-varying treatment effects. We apply these methods to the ADDRESS-BP trial, a type 3 hybrid implementation study designed to address racial disparities in health care by evaluating a practice-based implementation strategy to reduce hypertension in African American communities.
Collapse
Affiliation(s)
- James P Hughes
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA.
| | - Wen-Yu Lee
- Department of Population Health, Division of Biostatistics, New York University, New York, NY, USA
| | - Andrea B Troxel
- Department of Population Health, Division of Biostatistics, New York University, New York, NY, USA
| | - Patrick J Heagerty
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| |
Collapse
|
11
|
Li L, Kasza J, Recasens A, Ioannou L, Greenhill E, Merrett N, Cavallucci D, Ellis S, Madgwick H, Ko HS, Chantrill L, Loveday B, Nikfarjam M, Croagh D, Yang J, Dwyer A, Zalcberg J, Pilgrim C. SCANPatient: study protocol for a multi-centre, batched, stepped wedge, comparative effectiveness, randomised clinical trial of synoptic reporting of computerised tomography (CT) scans assessing cancers of the pancreas. Trials 2024; 25:388. [PMID: 38886755 PMCID: PMC11181632 DOI: 10.1186/s13063-024-08196-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2024] [Accepted: 05/22/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUND Complete surgical removal of pancreatic ductal adenocarcinoma (PDAC) is central to all curative treatment approaches for this aggressive disease, yet this is only possible in patients technically amenable to resection. Hence, an accurate assessment of whether patients are suitable for surgery is of paramount importance. The SCANPatient trial aims to test whether implementing a structured synoptic radiological report results in increased institutional accuracy in defining surgical resectability of non-metastatic PDAC. METHODS SCANPatient is a batched, stepped wedge, comparative effectiveness, cluster randomised clinical trial. The trial will be conducted at 33 Australian hospitals all of which hold regular multi-disciplinary team meetings (MDMs) to discuss newly diagnosed patients with PDAC. Each site is required to manage a minimum of 20 patients per year (across all stages). Hospitals will be randomised to begin synoptic reporting within a batched, stepped wedge design. Initially all hospitals will continue to use their current reporting method; within each batch, after each 6-month period, a randomly selected group of hospitals will commence using the synoptic reports, until all hospitals are using synoptic reporting. Each hospital will provide data from patients who (i) are aged 18 or older; (ii) have suspected PDAC and have an abdominal CT scan, and (iii) are presented at a participating MDM. Non-metastatic patients will be documented as one of the following categories: (1) locally advanced and surgically unresectable; (2) borderline resectable; or (3) anatomically clearly resectable (Note: Metastatic disease is treated as a separate category). Data collection will last for 36 months in each batch, and a total of 2400 patients will be included. DISCUSSION Better classifying patients with non-metastatic PDAC as having tumours that are either clearly resectable, borderline or locally advanced and unresectable may improve patient outcomes by optimising care and treatment planning. The borderline resectable group are a small but important cohort in whom surgery with curative intent may be considered; however, inconsistencies with definitions and an understanding of resectability status means these patients are often incorrectly classified and hence overlooked for curative options. TRIAL REGISTRATION The SCANPatient trial was registered on 17th May 2023 in the Australian New Zealand Clinical Trials Registry (ANZCTR) (ACTRN12623000508673).
Collapse
Affiliation(s)
- Lin Li
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Ariadna Recasens
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
- Monash Program, Alfred Health, Melbourne, VIC, Australia
| | - Liane Ioannou
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Elysia Greenhill
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
- Monash Program, Alfred Health, Melbourne, VIC, Australia
| | - Neil Merrett
- Department of Surgery, Western Sydney University, Sydney, NSW, Australia
| | - David Cavallucci
- Department of Surgery, Royal Brisbane and Women's Hospital, Herston, QLD, Australia
| | - Samantha Ellis
- Department of Radiology, Alfred Health, Melbourne, VIC, Australia
| | - Helen Madgwick
- CRP Consumer Reference Group, Monash University, Melbourne, VIC, Australia
| | - Hyun Soo Ko
- Department of Cancer Imaging, The Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- The Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC, Australia
| | - Lorraine Chantrill
- Department of Medical Oncology, Wollongong Hospital, Wollongong, NSW, Australia
- Faculty of Science, Medicine and Health, University of Wollongong, Wollongong, NSW, Australia
| | - Benjamin Loveday
- Department of Surgery, Royal Melbourne Hospital, Parkville, VIC, Australia
| | | | - Daniel Croagh
- Department of Surgery, Monash Medical Centre, Melbourne, VIC, Australia
| | - Jessica Yang
- Department of Radiology, Concord Hospital, Concord, NSW, Australia
| | - Andrew Dwyer
- SA Node National Imaging Facility, Flinders Medical Centre, Bedford Park, SA, Australia
| | - John Zalcberg
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Charles Pilgrim
- Cancer Research Program, School of Public Health and Preventive Medicine, Monash University, Level 5, 553 St Kilda Road, Melbourne, VIC, 3004, Australia.
| |
Collapse
|
12
|
Hamaker ME, Wildiers H, Ardito V, Arsandaux J, Barthod-Malat A, Davies P, Degol L, Ferrara L, Fourrier C, Kenis C, Kret M, Lalet C, Pelissier SM, O'Hanlon S, Rostoft S, Seghers N, Saillour-Glénisson F, Staines A, Schwimmer C, Thevenet V, Wallet C, Soubeyran P. Study protocol for two stepped-wedge interventional trials evaluating the effects of holistic information technology-based patient-oriented management in older multimorbid patients with cancer: The GERONTE trials. J Geriatr Oncol 2024; 15:101761. [PMID: 38581958 DOI: 10.1016/j.jgo.2024.101761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 03/07/2024] [Accepted: 03/26/2024] [Indexed: 04/08/2024]
Abstract
INTRODUCTION Current hospital-based care pathways are generally single-disease centred. As a result, coexisting morbidities are often suboptimally evaluated and managed, a deficiency becoming increasingly apparent among older patients who exhibit heterogeneity in health status, functional abilities, frailty, and other geriatric impairments. To address this issue, our study aims to assess a newly developed patient-centred care pathway for older patients with multimorbidity and cancer. The new care pathway was based on currently available evidence and co-designed by end-users including health care professionals, patients, and informal caregivers. Within this care pathway, all healthcare professionals involved in the care of older patients with multimorbidity and cancer will form a Health Professional Consortium (HPC). The role of the HPC will be to centralise oncologic and non-oncologic treatment recommendations in accordance with the patient's priorities. Moreover, an Advanced Practice Nurse will act as case-manager by being the primary point of contact for the patient, thus improving coordination between specialists, and by organising and leading the consortium. Patient monitoring and the HPC collaboration will be facilitated by digital communication tools designed specifically for this purpose, with the added benefit of being customisable for each patient. MATERIALS AND METHODS The GERONTE study is a prospective international, multicentric study consisting of two stepped-wedge trials performed at 16 clinical sites across three European countries. Each trial will include 720 patients aged 70 years and over with a new or progressive cancer (breast, lung, colorectal, prostate) and at least one moderate or severe multimorbidity. The patients in the intervention group will receive the new care pathway whereas patients in the control group will receive usual oncologic care. DISCUSSION GERONTE will evaluate whether this kind of holistic, patient-oriented healthcare management can improve quality of life (primary outcome) and other valuable endpoints in older patients with multimorbidity and cancer. An ancillary study will assess in depth the socio-economic impact of the intervention and deliver concrete implementation guidelines for the GERONTE intervention care pathway. TRIAL REGISTRATION FRONE: NCT05720910 TWOBE: NCT05423808.
Collapse
Affiliation(s)
- Marije E Hamaker
- Department of Geriatric Medicine, Diakonessenhuis Utrecht, the Netherlands.
| | - Hans Wildiers
- Department of General Medical Oncology, University Hospitals Leuven, Leuven, Belgium
| | - Vittoria Ardito
- Department SDA Bocconi, Government, Health and Not for profit Division, CERGAS, Bocconi University, Milan, Italy
| | - Julie Arsandaux
- Nantes Université, Univ Angers, Laboratoire de psychologie des Pays de la Loire, LPPL, UR 4638, F-44000 Nantes, France; Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Aurore Barthod-Malat
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Paul Davies
- School of Nursing, Psychotherapy and Community Health, Dublin City University, Glasnevin, Dublin 9, Ireland
| | - Lien Degol
- Department of General Medical Oncology, University Hospitals Leuven, Leuven, Belgium
| | - Lucia Ferrara
- Department SDA Bocconi, Government, Health and Not for profit Division, CERGAS, Bocconi University, Milan, Italy
| | - Celia Fourrier
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Cindy Kenis
- Department of General Medical Oncology and Geriatric Medicine, University Hospitals Leuven, Leuven, Belgium; Department of Public Health and Primary Care, Academic Centre for Nursing and Midwifery, KU Leuven, Leuven, Belgium
| | - Marion Kret
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France; CHU de Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Caroline Lalet
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Simone Mathoulin Pelissier
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France; Univ Bordeaux, Inserm BordHEalth eaux Population U1219 Epicene Team, France
| | - Shane O'Hanlon
- Department of Geriatric Medicine, St Vincent's University Hospital, D04 T6F4 Dublin, Ireland; Department of Geriatric Medicine, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Siri Rostoft
- Department of Geriatric Medicine, Oslo University Hospital, 0424 Oslo, Norway; Institute of Clinical Medicine, University of Oslo, 0318 Oslo, Norway
| | - Nelleke Seghers
- Department of Geriatric Medicine, Diakonessenhuis Utrecht, the Netherlands
| | - Florence Saillour-Glénisson
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France; CHU de Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Anthony Staines
- School of Nursing, Psychotherapy and Community Health, Dublin City University, Glasnevin, Dublin 9, Ireland
| | - Christine Schwimmer
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France; CHU de Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Vincent Thevenet
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Cedric Wallet
- Univ. Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France; CHU de Bordeaux, INSERM, Institut Bergonié, CIC 1401, Euclid/F-CRIN clinical trials platform, F-33000 Bordeaux, France
| | - Pierre Soubeyran
- Department of Medical Oncology, Institut Bergonié, Inserm U1312, SIRIC BRIO, Université de Bordeaux, 33076 Bordeaux, France
| |
Collapse
|
13
|
Nevins P, Ryan M, Davis-Plourde K, Ouyang Y, Macedo JAP, Meng C, Tong G, Wang X, Ortiz-Reyes L, Caille A, Li F, Taljaard M. Adherence to key recommendations for design and analysis of stepped-wedge cluster randomized trials: A review of trials published 2016-2022. Clin Trials 2024; 21:199-210. [PMID: 37990575 PMCID: PMC11003836 DOI: 10.1177/17407745231208397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
BACKGROUND/AIMS The stepped-wedge cluster randomized trial (SW-CRT), in which clusters are randomized to a time at which they will transition to the intervention condition - rather than a trial arm - is a relatively new design. SW-CRTs have additional design and analytical considerations compared to conventional parallel arm trials. To inform future methodological development, including guidance for trialists and the selection of parameters for statistical simulation studies, we conducted a review of recently published SW-CRTs. Specific objectives were to describe (1) the types of designs used in practice, (2) adherence to key requirements for statistical analysis, and (3) practices around covariate adjustment. We also examined changes in adherence over time and by journal impact factor. METHODS We used electronic searches to identify primary reports of SW-CRTs published 2016-2022. Two reviewers extracted information from each trial report and its protocol, if available, and resolved disagreements through discussion. RESULTS We identified 160 eligible trials, randomizing a median (Q1-Q3) of 11 (8-18) clusters to 5 (4-7) sequences. The majority (122, 76%) were cross-sectional (almost all with continuous recruitment), 23 (14%) were closed cohorts and 15 (9%) open cohorts. Many trials had complex design features such as multiple or multivariate primary outcomes (50, 31%) or time-dependent repeated measures (27, 22%). The most common type of primary outcome was binary (51%); continuous outcomes were less common (26%). The most frequently used method of analysis was a generalized linear mixed model (112, 70%); generalized estimating equations were used less frequently (12, 8%). Among 142 trials with fewer than 40 clusters, only 9 (6%) reported using methods appropriate for a small number of clusters. Statistical analyses clearly adjusted for time effects in 119 (74%), for within-cluster correlations in 132 (83%), and for distinct between-period correlations in 13 (8%). Covariates were included in the primary analysis of the primary outcome in 82 (51%) and were most often individual-level covariates; however, clear and complete pre-specification of covariates was uncommon. Adherence to some key methodological requirements (adjusting for time effects, accounting for within-period correlation) was higher among trials published in higher versus lower impact factor journals. Substantial improvements over time were not observed although a slight improvement was observed in the proportion accounting for a distinct between-period correlation. CONCLUSIONS Future methods development should prioritize methods for SW-CRTs with binary or time-to-event outcomes, small numbers of clusters, continuous recruitment designs, multivariate outcomes, or time-dependent repeated measures. Trialists, journal editors, and peer reviewers should be aware that SW-CRTs have additional methodological requirements over parallel arm designs including the need to account for period effects as well as complex intracluster correlations.
Collapse
Affiliation(s)
- Pascale Nevins
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
| | - Mary Ryan
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Kendra Davis-Plourde
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Yale Center for Analytical Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Yongdong Ouyang
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada
| | | | - Can Meng
- Yale Center for Analytical Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Guangyu Tong
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, USA
| | - Xueqi Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Section of Geriatrics, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Luis Ortiz-Reyes
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
| | - Agnès Caille
- Université de Tours, Université de Nantes, INSERM, SPHERE U1246, Tours, France
- INSERM CIC 1415, CHRU de Tours, Tours, France
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, USA
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
14
|
Tian Z, Li F. Information content of stepped wedge designs under the working independence assumption. J Stat Plan Inference 2024; 229:106097. [PMID: 37954217 PMCID: PMC10634667 DOI: 10.1016/j.jspi.2023.106097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2023]
Abstract
The stepped wedge design is increasingly popular in pragmatic trials and implementation science research studies for evaluating system-level interventions that are perceived to be beneficial to patient populations. An important step in planning a stepped wedge design is to understand the efficiency of the treatment effect estimator and hence the power of the study. We develop several novel analytical results for designing stepped wedge cluster randomized trials analyzed through generalized estimating equations under a misspecified working independence correlation structure. We first contribute a general variance expression of the treatment effect estimator when data collection is scheduled for each cluster-period. Because resource and patient-centered considerations may intentionally call for an incomplete design with outcome data being omitted for certain cluster-periods, we further derive the information content based on the robust sandwich variance to identify data elements that may be preferentially omitted with minimum loss of precision in estimating the treatment effect. We prove that centrosymmetric pairs of cluster-periods, treatment sequences and periods have identical information content and thus contribute equally to the treatment effect estimation, as long as the true covariance structure for the cluster-period means remains centrosymmetric. Finally, we provide an example of how to obtain an incomplete stepped wedge design that admits a more efficient independence GEE estimator but requires less data collection effort. Our results elegantly extend existing ones from linear mixed models coupled with model-based variances to accommodate a misspecified independence working correlation structure through the robust sandwich variances.
Collapse
Affiliation(s)
- Zibo Tian
- Department of Biostatistics, University of Florida, Gainesville, FL, USA
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, USA
| |
Collapse
|
15
|
Li F, Chen X, Tian Z, Wang R, Heagerty PJ. Planning stepped wedge cluster randomized trials to detect treatment effect heterogeneity. Stat Med 2024; 43:890-911. [PMID: 38115805 DOI: 10.1002/sim.9990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Revised: 09/22/2023] [Accepted: 11/30/2023] [Indexed: 12/21/2023]
Abstract
Stepped wedge design is a popular research design that enables a rigorous evaluation of candidate interventions by using a staggered cluster randomization strategy. While analytical methods were developed for designing stepped wedge trials, the prior focus has been solely on testing for the average treatment effect. With a growing interest on formal evaluation of the heterogeneity of treatment effects across patient subpopulations, trial planning efforts need appropriate methods to accurately identify sample sizes or design configurations that can generate evidence for both the average treatment effect and variations in subgroup treatment effects. To fill in that important gap, this article derives novel variance formulas for confirmatory analyses of treatment effect heterogeneity, that are applicable to both cross-sectional and closed-cohort stepped wedge designs. We additionally point out that the same framework can be used for more efficient average treatment effect analyses via covariate adjustment, and allows the use of familiar power formulas for average treatment effect analyses to proceed. Our results further sheds light on optimal design allocations of clusters to maximize the weighted precision for assessing both the average and heterogeneous treatment effects. We apply the new methods to the Lumbar Imaging with Reporting of Epidemiology Trial, and carry out a simulation study to validate our new methods.
Collapse
Affiliation(s)
- Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale University School of Public Health, New Haven, Connecticut, USA
| | - Xinyuan Chen
- Department of Mathematics and Statistics, Mississippi State University, Mississippi State, Mississippi, USA
| | - Zizhong Tian
- Department of Public Health Sciences, Pennsylvania State University College of Medicine, Hershey, Pennsylvania, USA
| | - Rui Wang
- Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, Massachusetts, USA
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Patrick J Heagerty
- Department of Biostatistics, University of Washington, Seattle, Washington, USA
| |
Collapse
|
16
|
Lee KM, Cheung YB. Cluster randomized trial designs for modeling time-varying intervention effects. Stat Med 2024; 43:49-60. [PMID: 37947024 DOI: 10.1002/sim.9941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 09/29/2023] [Accepted: 10/02/2023] [Indexed: 11/12/2023]
Abstract
Stepped-wedge cluster randomized trials (SW-CRTs) are typically analyzed assuming a constant intervention effect. In practice, the intervention effect may vary as a function of exposure time, leading to biased results. The estimation of time-on-intervention (TOI) effects specifies separate discrete intervention effects for each elapsed period of exposure time since the intervention was first introduced. It has been demonstrated to produce results with minimum bias and nominal coverage probabilities in the analysis of SW-CRTs. Due to the design's staggered crossover, TOI effect variances are heteroskedastic in a SW-CRT. Accordingly, we hypothesize that alternative CRT designs will be more efficient at modeling certain TOI effects. We derive and compare the variance estimators of TOI effects between a SW-CRT, parallel CRT (P-CRT), parallel CRT with baseline (PB-CRT), and novel parallel CRT with baseline and an all-exposed period (PBAE-CRT). We also prove that the time-averaged TOI effect variance and point estimators are identical to that of the constant intervention effect in both P-CRTs and PB-CRTs. We then use data collected from a hospital disinvestment study to simulate and compare the differences in TOI effect estimates between the different CRT designs. Our results reveal that the SW-CRT has the most efficient estimator for the early TOI effect, whereas the PB-CRT typically has the most efficient estimator for the long-term and time-averaged TOI effects. Overall, the PB-CRT with TOI effects can be a more appropriate choice of CRT design for modeling intervention effects that vary by exposure time.
Collapse
Affiliation(s)
- Kenneth Menglin Lee
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Yin Bun Cheung
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
- Signature Research Programme in Health Services & Systems Research, Duke-NUS Medical School, Singapore, Singapore
- Tampere Center for Child, Adolescent and Maternal Health Research, Tampere University, Tampere, Finland
| |
Collapse
|
17
|
Williams NJ, Cardamone NC, Beidas RS, Marcus SC. Calculating power for multilevel implementation trials in mental health: Meaningful effect sizes, intraclass correlation coefficients, and proportions of variance explained by covariates. IMPLEMENTATION RESEARCH AND PRACTICE 2024; 5:26334895241279153. [PMID: 39346518 PMCID: PMC11437582 DOI: 10.1177/26334895241279153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/01/2024] Open
Abstract
Background Despite the ubiquity of multilevel sampling, design, and analysis in mental health implementation trials, few resources are available that provide reference values of design parameters (e.g., effect size, intraclass correlation coefficient [ICC], and proportion of variance explained by covariates [covariate R 2]) needed to accurately determine sample size. The aim of this study was to provide empirical reference values for these parameters by aggregating data on implementation and clinical outcomes from multilevel implementation trials, including cluster randomized trials and individually randomized repeated measures trials, in mental health. The compendium of design parameters presented here represents plausible values that implementation scientists can use to guide sample size calculations for future trials. Method We searched NIH RePORTER for all federally funded, multilevel implementation trials addressing mental health populations and settings from 2010 to 2020. For all continuous and binary implementation and clinical outcomes included in eligible trials, we generated values of effect size, ICC, and covariate R2 at each level via secondary analysis of trial data or via extraction of estimates from analyses in published research reports. Effect sizes were calculated as Cohen d; ICCs were generated via one-way random effects ANOVAs; covariate R2 estimates were calculated using the reduction in variance approach. Results Seventeen trials were eligible, reporting on 53 implementation and clinical outcomes and 81 contrasts between implementation conditions. Tables of effect size, ICC, and covariate R2 are provided to guide implementation researchers in power analyses for designing multilevel implementation trials in mental health settings, including two- and three-level cluster randomized designs and unit-randomized repeated-measures designs. Conclusions Researchers can use the empirical reference values reported in this study to develop meaningful sample size determinations for multilevel implementation trials in mental health. Discussion focuses on the application of the reference values reported in this study.
Collapse
Affiliation(s)
- Nathaniel J. Williams
- Institute for the Study of Behavioral Health and Addiction, Boise State University, Boise, ID, USA
- School of Social Work, Boise State University, Boise, ID, USA
| | | | - Rinad S. Beidas
- Department of Medical Social Sciences, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Steven C. Marcus
- School of Social Policy and Practice, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
18
|
Grantham KL, Forbes AB, Hooper R, Kasza J. The staircase cluster randomised trial design: A pragmatic alternative to the stepped wedge. Stat Methods Med Res 2024; 33:24-41. [PMID: 38031417 PMCID: PMC10863363 DOI: 10.1177/09622802231202364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]
Abstract
This article introduces the 'staircase' design, derived from the zigzag pattern of steps along the diagonal of a stepped wedge design schematic where clusters switch from control to intervention conditions. Unlike a complete stepped wedge design where all participating clusters must collect and provide data for the entire trial duration, clusters in a staircase design are only required to be involved and collect data for a limited number of pre- and post-switch periods. This could alleviate some of the burden on participating clusters, encouraging involvement in the trial and reducing the likelihood of attrition. Staircase designs are already being implemented, although in the absence of a dedicated methodology, approaches to sample size and power calculations have been inconsistent. We provide expressions for the variance of the treatment effect estimator when a linear mixed model for an outcome is assumed for the analysis of staircase designs in order to enable appropriate sample size and power calculations. These include explicit variance expressions for basic staircase designs with one pre- and one post-switch measurement period. We show how the variance of the treatment effect estimator is related to key design parameters and demonstrate power calculations for examples based on a real trial.
Collapse
Affiliation(s)
- Kelsey L Grantham
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - Andrew B Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - Richard Hooper
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| |
Collapse
|
19
|
Murray DM, Heagerty P, Troendle J, Lin FC, Moyer J, Stevens J, Lytle L, Zhang X, Ilias M, Masterson MY, Redmond N, Tonwe V, Clark D, Mensah GA. Implementation Research at NHLBI: Methodological and Design Challenges and Lessons Learned from the DECIPHeR Initiative. Ethn Dis 2023; DECIPHeR:12-17. [PMID: 38846726 PMCID: PMC11099519 DOI: 10.18865/ed.decipher.12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2024] Open
Abstract
NHLBI funded seven projects as part of the Disparities Elimination through Coordinated Interventions to Prevent and Control Heart and Lung Disease Risk (DECIPHeR) Initiative. They were expected to collaborate with community partners to (1) employ validated theoretical or conceptual implementation research frameworks, (2) include implementation research study designs, (3) include implementation measures as primary outcomes, and (4) inform our understanding of mediators and mechanisms of action of the implementation strategy. Several projects focused on late-stage implementation strategies that optimally and sustainably delivered two or more evidence-based multilevel interventions to reduce or eliminate cardiovascular and/or pulmonary health disparities and to improve population health in high-burden communities. Projects that were successful in the three-year planning phase transitioned to a 4-year execution phase. NHLBI formed a Technical Assistance Workgroup during the planning phase to help awardees refine study aims, strengthen research designs, detail analytic plans, and to use valid sample size methods. This paper highlights methodological and study design challenges encountered during this process. Important lessons learned included (1) the need for greater emphasis on implementation outcomes, (2) the need to clearly distinguish between intervention and implementation strategies in the protocol, (3) the need to address clustering due to randomization of groups or clusters, (4) the need to address the cross-classification that results when intervention agents work across multiple units of randomization in the same arm, (5) the need to accommodate time-varying intervention effects in stepped-wedge designs, and (6) the need for data-based estimates of the parameters required for sample size estimation.
Collapse
Affiliation(s)
| | - Patrick Heagerty
- Department of Biostatistics, University of Washington, Seattle, WA
| | - James Troendle
- Office of Biostatistical Research, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - Feng-Chang Lin
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC
| | | | - June Stevens
- Departments of Nutrition and Epidemiology, University of North Carolina, Chapel Hill, NC
| | - Leslie Lytle
- Departments of Health Behavior and Nutrition, University of North Carolina, Chapel Hill, NC
| | - Xinzhi Zhang
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - Maliha Ilias
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - Mary Y. Masterson
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - Nicole Redmond
- Division of Cardiovascular Sciences, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - Veronica Tonwe
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - Dave Clark
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, Bethesda, MD
| | - George A. Mensah
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, Bethesda, MD
| |
Collapse
|
20
|
Correction: Effects of Implementation of a Supervised Walking Program in Veterans Affairs Hospitals. Ann Intern Med 2023; 176:1575. [PMID: 37844304 DOI: 10.7326/l23-0389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/18/2023] Open
|
21
|
Kasza J, Bowden R, Ouyang Y, Taljaard M, Forbes AB. Does it decay? Obtaining decaying correlation parameter values from previously analysed cluster randomised trials. Stat Methods Med Res 2023; 32:2123-2134. [PMID: 37589088 PMCID: PMC10683336 DOI: 10.1177/09622802231194753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
A frequently applied assumption in the analysis of data from cluster randomised trials is that the outcomes from all participants within a cluster are equally correlated. That is, the intracluster correlation, which describes the degree of dependence between outcomes from participants in the same cluster, is the same for each pair of participants in a cluster. However, recent work has discussed the importance of allowing for this correlation to decay as the time between the measurement of participants in a cluster increases. Incorrect omission of such a decay can lead to under-powered studies, and confidence intervals for estimated treatment effects can be too narrow or too wide, depending on the characteristics of the design. When planning studies, researchers often rely on previously reported analyses of trials to inform their choice of intracluster correlation. However, most reported analyses of clustered data do not incorporate a correlation decay. Thus, often all that is available are estimates of intracluster correlations obtained under the potentially incorrect assumption of no decay. In this article, we show that it is possible to use intracluster correlation values obtained from models that incorrectly omit a decay to inform plausible choices of decaying correlations. Our focus is on intracluster correlation estimates for continuous outcomes obtained by fitting linear mixed models with exchangeable or block-exchangeable correlation structures. We describe how plausible values for decaying correlations may be obtained given these estimated intracluster correlations. An online app is presented that allows users to obtain plausible values of the decay, which can be used at the trial planning stage to assess the sensitivity of sample size and power calculations to decaying correlation structures.
Collapse
Affiliation(s)
- Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Rhys Bowden
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Yongdong Ouyang
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
| | - Andrew B Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| |
Collapse
|
22
|
Ouyang Y, Hemming K, Li F, Taljaard M. Estimating intra-cluster correlation coefficients for planning longitudinal cluster randomized trials: a tutorial. Int J Epidemiol 2023; 52:1634-1647. [PMID: 37196320 PMCID: PMC10555741 DOI: 10.1093/ije/dyad062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 04/26/2023] [Indexed: 05/19/2023] Open
Abstract
It is well-known that designing a cluster randomized trial (CRT) requires an advance estimate of the intra-cluster correlation coefficient (ICC). In the case of longitudinal CRTs, where outcomes are assessed repeatedly in each cluster over time, estimates for more complex correlation structures are required. Three common types of correlation structures for longitudinal CRTs are exchangeable, nested/block exchangeable and exponential decay correlations-the latter two allow the strength of the correlation to weaken over time. Determining sample sizes under these latter two structures requires advance specification of the within-period ICC and cluster autocorrelation coefficient as well as the intra-individual autocorrelation coefficient in the case of a cohort design. How to estimate these coefficients is a common challenge for investigators. When appropriate estimates from previously published longitudinal CRTs are not available, one possibility is to re-analyse data from an available trial dataset or to access observational data to estimate these parameters in advance of a trial. In this tutorial, we demonstrate how to estimate correlation parameters under these correlation structures for continuous and binary outcomes. We first introduce the correlation structures and their underlying model assumptions under a mixed-effects regression framework. With practical advice for implementation, we then demonstrate how the correlation parameters can be estimated using examples and we provide programming code in R, SAS, and Stata. An Rshiny app is available that allows investigators to upload an existing dataset and obtain the estimated correlation parameters. We conclude by identifying some gaps in the literature.
Collapse
Affiliation(s)
- Yongdong Ouyang
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| | - Karla Hemming
- Institute of Applied Health Research, The University of Birmingham, Birmingham, UK
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, CT, USA
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
23
|
Bray JE, Nehme Z, Finn JC, Kasza J, Clark RA, Stub D, Cadilhac DA, Buttery AK, Woods J, Kim J, Smith BJ, Smith K, Cartledge S, Beauchamp A, Dodge N, Walker T, Flemming-Judge E, Chow C, Stewart M, Cox N, van Gaal W, Nadurata V, Cameron P. A protocol for the Heart Matters stepped wedge cluster randomised trial: The effectiveness of heart attack education in regions at highest-risk. Resusc Plus 2023; 15:100431. [PMID: 37555197 PMCID: PMC10405322 DOI: 10.1016/j.resplu.2023.100431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2023] Open
Abstract
AIM To describe the Heart Matters (HM) trial which aims to evaluate the effectiveness of a community heart attack education intervention in high-risk areas in Victoria, Australia. These local government areas (LGAs) have high rates of acute coronary syndrome (ACS), out-of-hospital cardiac arrest (OHCA), cardiovascular risk factors, and low rates of emergency medical service (EMS) use for ACS. METHODS The trial follows a stepped-wedge cluster randomised design, with eight clusters (high-risk LGAs) randomly assigned to transition from control to intervention every four months. Two pairs of LGAs will transition simultaneously due to their proximity. The intervention consists of a heart attack education program delivered by trained HM Coordinators, with additional support from opportunistic media and a geo-targeted social media campaign. The primary outcome measure is the proportion of residents from the eight LGAs who present to emergency departments by EMS during an ACS event. Secondary outcomes include prehospital delay time, rates of OHCA and heart attack awareness. The primary and secondary outcomes will be analysed at the patient/participant level using mixed-effects logistic regression models. A detailed program evaluation is also being conducted. The trial was registered on August 9, 2021 (NCT04995900). RESULTS The intervention was implemented between February 2022 and March 2023, and outcome data will be collected from administrative databases, registries, and surveys. Primary trial data is expected to be locked for analysis by October 31st 2023, with a follow-up planned until March 31st 2024. CONCLUSION The results from this trial will provide high-level evidence the effectiveness of a community education intervention targeting regions at highest-risk of ACS and low EMS use.
Collapse
Affiliation(s)
- Janet E. Bray
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- School of Nursing, Curtin University, Australia
- Alfred Health, Australia
| | - Ziad Nehme
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- Ambulance Victoria, Australia
| | - Judith C. Finn
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- School of Nursing, Curtin University, Australia
| | - Jessica Kasza
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
| | - Robyn A. Clark
- Caring Futures Institute, Flinders University, Australia
- Southern Adelaide Local Health Network, Australia
| | - Dion Stub
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- Alfred Health, Australia
- Ambulance Victoria, Australia
| | - Dominique A. Cadilhac
- Stroke and Ageing Research, Department of Medicine, School of Clinical Sciences at Monash Health, Monash University, Australia
- Stroke Theme, Florey Institute of Neuroscience and Mental Health University of Melbourne, Australia
| | | | - Janelle Woods
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- National Heart Foundation of Australia, Australia
| | - Joosup Kim
- Stroke and Ageing Research, Department of Medicine, School of Clinical Sciences at Monash Health, Monash University, Australia
- Stroke Theme, Florey Institute of Neuroscience and Mental Health University of Melbourne, Australia
| | - Ben J. Smith
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- Prevention Research Collaboration, School of Public Health, University of Sydney, Australia
| | - Karen Smith
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- Department of Research and Innovation, Silverchain, Australia
| | - Susie Cartledge
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
| | | | - Natasha Dodge
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
| | - Tony Walker
- Department of Paramedicine, Monash University, Australia
| | | | - Clara Chow
- Faculty of Medicine and Health, Westmead Applied Research Centre, University of Sydney, Australia
- Westmead Hospital, Australia
| | | | | | | | | | - Peter Cameron
- Monash School of Public Health and Preventive Medicine, Monash University, Australia
- Alfred Health, Australia
| |
Collapse
|
24
|
Maleyeff L, Li F, Haneuse S, Wang R. Assessing exposure-time treatment effect heterogeneity in stepped-wedge cluster randomized trials. Biometrics 2023; 79:2551-2564. [PMID: 36416302 PMCID: PMC10203056 DOI: 10.1111/biom.13803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 11/16/2022] [Indexed: 11/24/2022]
Abstract
A stepped-wedge cluster randomized trial (CRT) is a unidirectional crossover study in which timings of treatment initiation for clusters are randomized. Because the timing of treatment initiation is different for each cluster, an emerging question is whether the treatment effect depends on the exposure time, namely, the time duration since the initiation of treatment. Existing approaches for assessing exposure-time treatment effect heterogeneity either assume a parametric functional form of exposure time or model the exposure time as a categorical variable, in which case the number of parameters increases with the number of exposure-time periods, leading to a potential loss in efficiency. In this article, we propose a new model formulation for assessing treatment effect heterogeneity over exposure time. Rather than a categorical term for each level of exposure time, the proposed model includes a random effect to represent varying treatment effects by exposure time. This allows for pooling information across exposure-time periods and may result in more precise average and exposure-time-specific treatment effect estimates. In addition, we develop an accompanying permutation test for the variance component of the heterogeneous treatment effect parameters. We conduct simulation studies to compare the proposed model and permutation test to alternative methods to elucidate their finite-sample operating characteristics, and to generate practical guidance on model choices for assessing exposure-time treatment effect heterogeneity in stepped-wedge CRTs.
Collapse
Affiliation(s)
- Lara Maleyeff
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, Connecticut, USA
| | - Sebastien Haneuse
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Rui Wang
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
- Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
25
|
Li F, Kasza J, Turner EL, Rathouz PJ, Forbes AB, Preisser JS. Generalizing the information content for stepped wedge designs: A marginal modeling approach. Scand Stat Theory Appl 2023; 50:1048-1067. [PMID: 37601275 PMCID: PMC10434823 DOI: 10.1111/sjos.12615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 09/02/2022] [Indexed: 11/30/2022]
Abstract
Stepped wedge trials are increasingly adopted because practical constraints necessitate staggered roll-out. While a complete design requires clusters to collect data in all periods, resource and patient-centered considerations may call for an incomplete stepped wedge design to minimize data collection burden. To study incomplete designs, we expand the metric of information content to discrete outcomes. We operate under a marginal model with general link and variance functions, and derive information content expressions when data elements (cells, sequences, periods) are omitted. We show that the centrosymmetric patterns of information content can hold for discrete outcomes with the variance-stabilizing link function. We perform numerical studies under the canonical link function, and find that while the patterns of information content for cells are approximately centrosymmetric for all examined underlying secular trends, the patterns of information content for sequences or periods are more sensitive to the secular trend, and may be far from centrosymmetric.
Collapse
Affiliation(s)
- Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, Connecticut, USA
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Elizabeth L. Turner
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA
| | - Paul J. Rathouz
- Department of Population Health, The University of Texas at Austin, Austin, Texas, USA
| | - Andrew B. Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - John S. Preisser
- Department of Epidemiology, Gillings School of Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
26
|
Moerbeek M. Optimal allocation of clusters in stepped wedge designs with a decaying correlation structure. PLoS One 2023; 18:e0289275. [PMID: 37585398 PMCID: PMC10431648 DOI: 10.1371/journal.pone.0289275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 07/15/2023] [Indexed: 08/18/2023] Open
Abstract
The cluster randomized stepped wedge design is a multi-period uni-directional switch design in which all clusters start in the control condition and at the beginning of each new period a random sample of clusters crosses over to the intervention condition. Such designs often use uniform allocation, with an equal number of clusters at each treatment switch. However, the uniform allocation is not necessarily the most efficient. This study derives the optimal allocation of clusters to treatment sequences in the cluster randomized stepped wedge design, for both cohort and cross-sectional designs. The correlation structure is exponential decay, meaning the correlation decreases with the time lag between two measurements. The optimal allocation is shown to depend on the intraclass correlation coefficient, the number of subjects per cluster-period and the cluster and (in the case of a cohort design) individual autocorrelation coefficients. For small to medium values of these autocorrelations those sequences that have their treatment switch earlier or later in the study are allocated a larger proportion of clusters than those clusters that have their treatment switch halfway the study. When the autocorrelation coefficients increase, the clusters become more equally distributed across the treatment sequences. For the cohort design, the optimal allocation is almost equal to the uniform allocation when both autocorrelations approach the value 1. For almost all scenarios that were studied, the efficiency of the uniform allocation is 0.8 or higher. R code to derive the optimal allocation is available online.
Collapse
Affiliation(s)
- Mirjam Moerbeek
- Department of Methodology and Statistics, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
27
|
Moerbeek M. Optimal allocation to treatment sequences in individually randomized stepped-wedge designs with attrition. Clin Trials 2023; 20:242-251. [PMID: 36825509 PMCID: PMC10262341 DOI: 10.1177/17407745231154260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
BACKGROUND/AIMS The stepped-wedge design has been extensively studied in the setting of the cluster randomized trial, but less so for the individually randomized trial. This article derives the optimal allocation of individuals to treatment sequences. The focus is on designs where all individuals start in the control condition and at the beginning of each time period some of them cross over to the intervention, so that at the end of the trial all of them receive the intervention. METHODS The statistical model that takes into account the nesting of repeated measurements within subjects is presented. It is also shown how possible attrition is taken into account. The effect of the intervention is assumed to be sustained so that it does not change after the treatment switch. An exponential decay correlation structure is assumed, implying that the correlation between any two time point decreases with the time lag. Matrix algebra is used to derive the relation between the allocation of units to treatment sequences and the variance of the treatment effect estimator. The optimal allocation is the one that results in smallest variance. RESULTS Results are presented for three to six treatment sequences. It is shown that the optimal allocation highly depends on the correlation parameter ρ and attrition rate r between any two adjacent time points. The uniform allocation, where each treatment sequence has the same number of individuals, is often not the most efficient. For 0 . 1 ≤ ρ ≤ 0 . 9 and r = 0 , 0 . 05 , 0 . 2 , its efficiency relative to the optimal allocation is at least 0.8. It is furthermore shown how a constrained optimal allocation can be derived in case the optimal allocation is not feasible from a practical point of view. CONCLUSION This article provides the methodology for designing individually randomized stepped-wedge designs, taking into account the possibility of attrition. As such it helps researchers to plan their trial in an efficient way. To use the methodology, prior estimates of the degree of attrition and intraclass correlation coefficient are needed. It is advocated that researchers clearly report the estimates of these quantities to help facilitate planning future trials.
Collapse
Affiliation(s)
- Mirjam Moerbeek
- Department of Methodology and Statistics, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
28
|
Ouyang Y, Kulkarni MA, Protopopoff N, Li F, Taljaard M. Accounting for complex intracluster correlations in longitudinal cluster randomized trials: a case study in malaria vector control. BMC Med Res Methodol 2023; 23:64. [PMID: 36932347 PMCID: PMC10021932 DOI: 10.1186/s12874-023-01871-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 02/20/2023] [Indexed: 03/19/2023] Open
Abstract
BACKGROUND The effectiveness of malaria vector control interventions is often evaluated using cluster randomized trials (CRT) with outcomes assessed using repeated cross-sectional surveys. A key requirement for appropriate design and analysis of longitudinal CRTs is accounting for the intra-cluster correlation coefficient (ICC). In addition to exchangeable correlation (constant ICC over time), correlation structures proposed for longitudinal CRT are block exchangeable (allows a different within- and between-period ICC) and exponential decay (allows between-period ICC to decay exponentially). More flexible correlation structures are available in statistical software packages and, although not formally proposed for longitudinal CRTs, may offer some advantages. Our objectives were to empirically explore the impact of these correlation structures on treatment effect inferences, identify gaps in the methodological literature, and make practical recommendations. METHODS We obtained data from a parallel-arm CRT conducted in Tanzania to compare four different types of insecticide-treated bed-nets. Malaria prevalence was assessed in cross-sectional surveys of 45 households in each of 84 villages at baseline, 12-, 18- and 24-months post-randomization. We re-analyzed the data using mixed-effects logistic regression according to a prespecified analysis plan but under five different correlation structures as well as a robust variance estimator under exchangeable correlation and compared the estimated correlations and treatment effects. A proof-of-concept simulation was conducted to explore general conclusions. RESULTS The estimated correlation structures varied substantially across different models. The unstructured model was the best-fitting model based on information criteria. Although point estimates and confidence intervals for the treatment effect were similar, allowing for more flexible correlation structures led to different conclusions based on statistical significance. Use of robust variance estimators generally led to wider confidence intervals. Simulation results showed that under-specification can lead to coverage probabilities much lower than nominal levels, but over-specification is more likely to maintain nominal coverage. CONCLUSION More flexible correlation structures should not be ruled out in longitudinal CRTs. This may be particularly important in malaria trials where outcomes may fluctuate over time. In the absence of robust methods for selecting the best-fitting correlation structure, researchers should examine sensitivity of results to different assumptions about the ICC and consider robust variance estimators.
Collapse
Affiliation(s)
- Yongdong Ouyang
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, 1053 Carling Ave, Ottawa, ON, K1Y 4E9, Canada.
- School of Epidemiology and Public Health, University of Ottawa, 600 Peter Morand Crescent, Ottawa, ON, Canada.
| | - Manisha A Kulkarni
- School of Epidemiology and Public Health, University of Ottawa, 600 Peter Morand Crescent, Ottawa, ON, Canada
| | - Natacha Protopopoff
- Department of Disease Control, London School of Hygiene & Tropical Medicine, London, UK
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, CT, USA
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, 1053 Carling Ave, Ottawa, ON, K1Y 4E9, Canada
- School of Epidemiology and Public Health, University of Ottawa, 600 Peter Morand Crescent, Ottawa, ON, Canada
| |
Collapse
|
29
|
Zhang Y, Preisser JS, Li F, Turner EL, Toles M, Rathouz PJ. GEEMAEE: A SAS macro for the analysis of correlated outcomes based on GEE and finite-sample adjustments with application to cluster randomized trials. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 230:107362. [PMID: 36709555 PMCID: PMC10037297 DOI: 10.1016/j.cmpb.2023.107362] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 01/13/2023] [Accepted: 01/17/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND AND OBJECTIVES Generalized estimating equations (GEE) are used to analyze correlated outcomes in marginal regression models with population-averaged interpretations of exposure effects. Limitations of popular software for GEE include: (i) user choice is restricted to a small set of within-cluster pairwise correlation (intra-class correlation; ICC) structures; and (ii) inference on ICC parameters is usually not possible because the precision of their estimates is not quantified. This is important because ICC values inform the design of cluster randomized trials. Beyond the standard GEE implementation, use of paired estimating equations (Prentice 1988) provides: (i) flexible specification of models for pairwise correlations and (ii) standard errors for ICC estimates. However, most GEEs give biased estimates of standard errors and correlations when the number of clusters is small (roughly, ≤40). Consequently, there is a need for software to provide GEE analysis with finite-sample bias-corrections. METHODS The SAS macro GEEMAEE implements paired estimating equations to simultaneously estimate parameters in marginal mean and ICC models. It provides bias-corrected standard errors and uses matrix-adjusted estimating equations (MAEE) for bias-corrected estimation of correlations. Several built-in correlation matrix options, rarely found in software, are offered for multi-period, cluster randomized trials and similarly structured longitudinal observational data structures. Additional options include user-specified correlation structures and deletion diagnostics, namely Cooks' Distance and DBETA statistics that estimate the influence of observations, cluster-periods (when applicable) and clusters. RESULTS GEEMAEE is illustrated for a binary and a count outcome in two stepped wedge cluster randomized trials and a binary outcome in a longitudinal study of disease surveillance. Use of MAEE resulted in larger values of correlation estimates compared to uncorrected estimating equations. Use of bias-corrected variance estimators resulted in (appropriately) larger values of standard errors compared to the usual sandwich estimators. Deletion diagnostics identified the clusters and cluster-periods having the most influence. CONCLUSIONS The SAS macro GEEMAEE provides regression analysis for clustered or longitudinal responses, and is particularly useful when the number of clusters is small. Flexible specification and bias-corrected estimation of pairwise correlation parameters and standard errors are key features of the software to provide valid inference in real-world settings.
Collapse
Affiliation(s)
- Ying Zhang
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, 27514, U.S.A.
| | - John S Preisser
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, 27514, U.S.A
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, U.S.A; Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, CT, U.S.A
| | - Elizabeth L Turner
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, U.S.A
| | - Mark Toles
- School of Nursing, University of North Carolina, Chapel Hill, NC, U.S.A
| | - Paul J Rathouz
- Department of Population Health, The University of Texas at Austin, Austin, TX, U.S.A
| |
Collapse
|
30
|
Sarkodie SK, Wason JMS, Grayling MJ. A hybrid approach to comparing parallel-group and stepped-wedge cluster-randomized trials with a continuous primary outcome when there is uncertainty in the intra-cluster correlation. Clin Trials 2023; 20:59-70. [PMID: 36086822 PMCID: PMC9940131 DOI: 10.1177/17407745221123507] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
BACKGROUND/AIMS To evaluate how uncertainty in the intra-cluster correlation impacts whether a parallel-group or stepped-wedge cluster-randomized trial design is more efficient in terms of the required sample size, in the case of cross-sectional stepped-wedge cluster-randomized trials and continuous outcome data. METHODS We motivate our work by reviewing how the intra-cluster correlation and standard deviation were justified in 54 health technology assessment reports on cluster-randomized trials. To enable uncertainty at the design stage to be incorporated into the design specification, we then describe how sample size calculation can be performed for cluster- randomized trials in the 'hybrid' framework, which places priors on design parameters and controls the expected power in place of the conventional frequentist power. Comparison of the parallel-group and stepped-wedge cluster-randomized trial designs is conducted by placing Beta and truncated Normal priors on the intra-cluster correlation, and a Gamma prior on the standard deviation. RESULTS Many Health Technology Assessment reports did not adhere to the Consolidated Standards of Reporting Trials guideline of indicating the uncertainty around the assumed intra-cluster correlation, while others did not justify the assumed intra-cluster correlation or standard deviation. Even for a prior intra-cluster correlation distribution with a small mode, moderate prior densities on high intra-cluster correlation values can lead to a stepped-wedge cluster-randomized trial being more efficient because of the degree to which a stepped-wedge cluster-randomized trial is more efficient for high intra-cluster correlations. With careful specification of the priors, the designs in the hybrid framework can become more robust to, for example, an unexpectedly large value of the outcome variance. CONCLUSION When there is difficulty obtaining a reliable value for the intra-cluster correlation to assume at the design stage, the proposed methodology offers an appealing approach to sample size calculation. Often, uncertainty in the intra-cluster correlation will mean a stepped-wedge cluster-randomized trial is more efficient than a parallel-group cluster-randomized trial design.
Collapse
Affiliation(s)
- Samuel K Sarkodie
- Samuel K Sarkodie, Population Health
Sciences Institute, Newcastle University, 4th Floor Ridley Building 1, Queen
Victoria Road, Newcastle upon Tyne NE1 7RU, UK.
| | | | | |
Collapse
|
31
|
Herrin J, Finney Rutten LJ, Ruddy KJ, Kroenke K, Cheville AL. Pragmatic cluster randomized trial to evaluate effectiveness and implementation of EHR-facilitated collaborative symptom control in cancer (E2C2): addendum. Trials 2023; 24:21. [PMID: 36624460 PMCID: PMC9830868 DOI: 10.1186/s13063-022-06983-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/03/2022] [Indexed: 01/11/2023] Open
Abstract
We previously described the hypotheses, outcomes, design, and analysis for E2C2, a pragmatic stepped-wedge trial to assess an intervention to improve symptom control in patients with cancer. Subsequent consideration of the design and cohort led to the addition of a second primary hypothesis. This article describes and presents the rationale for this second hypothesis. This addendum also details a revised analytic approach, necessitated by inconsistencies in the original analytic plan. The design, outcomes, and other aspects of the protocol remain unchanged.
Collapse
Affiliation(s)
- Jeph Herrin
- grid.47100.320000000419368710Yale University School of Medicine, New Haven, CT USA
| | - Lila J. Finney Rutten
- grid.428370.a0000 0004 0409 2643Medical Affairs, Exact Sciences, Madison, WI USA ,grid.66875.3a0000 0004 0459 167XDepartment of Quantitative Health Sciences, Mayo Clinic, Rochester, MN USA
| | - Kathryn J. Ruddy
- grid.66875.3a0000 0004 0459 167XDivision of Medical Oncology, Mayo Clinic, Rochester, MN USA
| | - Kurt Kroenke
- grid.257413.60000 0001 2287 3919Indiana University School of Medicine, Indianapolis, IN USA ,grid.448342.d0000 0001 2287 2027Regenstrief Institute, Inc, Indianapolis, IN USA
| | - Andrea L. Cheville
- grid.66875.3a0000 0004 0459 167XRobert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN USA ,grid.66875.3a0000 0004 0459 167XDepartment of Physical Medicine and Rehabilitation, Mayo Clinic, Rochester, MN USA
| |
Collapse
|
32
|
Ouyang Y, Li F, Preisser JS, Taljaard M. Sample size calculators for planning stepped-wedge cluster randomized trials: a review and comparison. Int J Epidemiol 2022; 51:2000-2013. [PMID: 35679584 PMCID: PMC9749719 DOI: 10.1093/ije/dyac123] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 05/17/2022] [Indexed: 01/21/2023] Open
Abstract
Recent years have seen a surge of interest in stepped-wedge cluster randomized trials (SW-CRTs). SW-CRTs include several design variations and methodology is rapidly developing. Accordingly, a variety of power and sample size calculation software for SW-CRTs has been developed. However, each calculator may support only a selected set of design features and may not be appropriate for all scenarios. Currently, there is no resource to assist researchers in selecting the most appropriate calculator for planning their trials. In this paper, we review and classify 18 existing calculators that can be implemented in major platforms, such as R, SAS, Stata, Microsoft Excel, PASS and nQuery. After reviewing the main sample size considerations for SW-CRTs, we summarize the features supported by the available calculators, including the types of designs, outcomes, correlation structures and treatment effects; whether incomplete designs, cluster-size variation or secular trends are accommodated; and the analytical approach used. We then discuss in more detail four main calculators and identify their strengths and limitations. We illustrate how to use these four calculators to compute power for two real SW-CRTs with a continuous and binary outcome and compare the results. We show that the choice of calculator can make a substantial difference in the calculated power and explain these differences. Finally, we make recommendations for implementing sample size or power calculations using the available calculators. An R Shiny app is available for users to select the calculator that meets their requirements (https://douyang.shinyapps.io/swcrtcalculator/).
Collapse
Affiliation(s)
- Yongdong Ouyang
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, CT, USA
| | - John S Preisser
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
33
|
Moyer JC, Heagerty PJ, Murray DM. Analysis of multiple-period group randomized trials: random coefficients model or repeated measures ANOVA? Trials 2022; 23:987. [PMID: 36476294 PMCID: PMC9727985 DOI: 10.1186/s13063-022-06917-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 11/11/2022] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Multiple-period parallel group randomized trials (GRTs) analyzed with linear mixed models can represent time in mean models as continuous or categorical. If time is continuous, random effects are traditionally group- and member-level deviations from condition-specific slopes and intercepts and are referred to as random coefficients (RC) analytic models. If time is categorical, random effects are traditionally group- and member-level deviations from time-specific condition means and are referred to as repeated measures ANOVA (RM-ANOVA) analytic models. Longstanding guidance recommends the use of RC over RM-ANOVA for parallel GRTs with more than two periods because RC exhibited nominal type I error rates for both time parameterizations while RM-ANOVA exhibited inflated type I error rates when applied to data generated using the RC model. However, this recommendation was developed assuming a variance components covariance matrix for the RM-ANOVA, using only cross-sectional data, and explicitly modeling time × group variation. Left unanswered were how well RM-ANOVA with an unstructured covariance would perform on data generated according to the RC mechanism, if similar patterns would be observed in cohort data, and the impact of not modeling time × group variation if such variation was present in the data-generating model. METHODS Continuous outcomes for cohort and cross-sectional parallel GRT data were simulated according to RM-ANOVA and RC mechanisms at five total time periods. All simulations assumed time × group variation. We varied the number of groups, group size, and intra-cluster correlation. Analytic models using RC, RM-ANOVA, RM-ANOVA with unstructured covariance, and a Saturated random effects structure were applied to the data. All analytic models specified time × group random effects. The analytic models were then reapplied without specifying random effects for time × group. RESULTS Results indicated the RC and saturated analytic models maintained the nominal type I error rate in all data sets, RM-ANOVA with an unstructured covariance did not avoid type I error rate inflation when applied to cohort RC data, and analytic models omitting time-varying group random effects when such variation exists in the data were prone to substantial type I error inflation unless the residual error variance is high relative to the time × group variance. CONCLUSION The time × group RC and saturated analytic models are recommended as the default for multiple period parallel GRTs.
Collapse
Affiliation(s)
- Jonathan C. Moyer
- Office of Disease Prevention, National Institutes of Health, Bethesda, MD USA
| | | | - David M. Murray
- Office of Disease Prevention, National Institutes of Health, Bethesda, MD USA
| |
Collapse
|
34
|
Gallis JA, Wang X, Rathouz PJ, Preisser JS, Li F, Turner EL. power swgee: GEE-based power calculations in stepped wedge cluster randomized trials. THE STATA JOURNAL 2022; 22:811-841. [PMID: 36968149 PMCID: PMC10035664 DOI: 10.1177/1536867x221140953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Stepped wedge cluster randomized trials are increasingly being used to evaluate interventions in medical, public health, educational, and social science contexts. With the longitudinal and crossover nature of a SW-CRT, complex analysis techniques are often needed which makes appropriately powering SW-CRTs challenging. In this paper, we introduce a newly-developed SW-CRT power calculator, embedded within the power command in Stata. The power calculator assumes a marginal model (i.e., generalized estimating equations [GEE]) for the primary analysis of SW-CRTs, for which other currently available SW-CRT power calculators may not be suitable. The program accommodates complete cross-sectional and closed-cohort designs, and includes multilevel correlation structures appropriate for such designs. We discuss the methods and formulae underlying our SW-CRT calculator, and provide illustrative examples of the use of power swgee. We provide suggestions about the choice of parameters in power swgee, and conclude by discussing areas of future research which may improve the program.
Collapse
Affiliation(s)
- John A Gallis
- Department of Biostatistics, Duke University, Duke Global Health Institute, Durham, NC
| | - Xueqi Wang
- Department of Biostatistics, Duke University, Duke Global Health Institute, Durham, NC
| | - Paul J Rathouz
- Department of Population Health, University of Texas at Austin, Dell Medical School, Austin, TX
| | - John S Preisser
- Department of Biosttistics, University of North Carolina at Chapel Hill, Gillings School of Global Public Health, Chapel Hill, NC
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, Center for Methods in Implementation, Prevention Science, New Haven, CT
| | - Elizabeth L Turner
- Department of Biostatistics, Duke University, Duke Global Health Institute, Durham, NC
| |
Collapse
|
35
|
Kenny A, Voldal E, Xia F, Heagerty PJ, Hughes JP. Analysis of stepped wedge cluster randomized trials in the presence of a time-varying treatment effect. Stat Med 2022; 41:4311-4339. [PMID: 35774016 PMCID: PMC9481733 DOI: 10.1002/sim.9511] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Revised: 06/13/2022] [Accepted: 06/20/2022] [Indexed: 11/11/2022]
Abstract
Stepped wedge cluster randomized controlled trials are typically analyzed using models that assume the full effect of the treatment is achieved instantaneously. We provide an analytical framework for scenarios in which the treatment effect varies as a function of exposure time (time since the start of treatment) and define the "effect curve" as the magnitude of the treatment effect on the linear predictor scale as a function of exposure time. The "time-averaged treatment effect" (TATE) and "long-term treatment effect" (LTE) are summaries of this curve. We analytically derive the expectation of the estimatorδ ^ $$ \hat{\delta} $$ resulting from a model that assumes an immediate treatment effect and show that it can be expressed as a weighted sum of the time-specific treatment effects corresponding to the observed exposure times. Surprisingly, although the weights sum to one, some of the weights can be negative. This implies thatδ ^ $$ \hat{\delta} $$ may be severely misleading and can even converge to a value of the opposite sign of the true TATE or LTE. We describe several models, some of which make assumptions about the shape of the effect curve, that can be used to simultaneously estimate the entire effect curve, the TATE, and the LTE. We evaluate these models in a simulation study to examine the operating characteristics of the resulting estimators and apply them to two real datasets.
Collapse
Affiliation(s)
- Avi Kenny
- Department of Biostatistics, University of Washington, Seattle, Washington
| | - Emily Voldal
- Department of Biostatistics, University of Washington, Seattle, Washington
| | - Fan Xia
- Department of Biostatistics, University of Washington, Seattle, Washington
| | | | - James P. Hughes
- Department of Biostatistics, University of Washington, Seattle, Washington
| |
Collapse
|
36
|
Mildenberger P, König J. Influence of cluster-period cells in stepped wedge cluster randomized trials. Biom J 2022. [PMID: 36161328 DOI: 10.1002/bimj.202100383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 08/01/2022] [Accepted: 08/14/2022] [Indexed: 11/09/2022]
Abstract
Stepped wedge cluster randomized trials (SWCRT) are increasingly used for the evaluation of complex interventions in health services research. They randomly allocate treatments to clusters that switch to intervention under investigation at variable time points without returning to control condition. The resulting unbalanced allocation over time periods and the uncertainty about the underlying correlation structures at cluster-level renders designing and analyzing SWCRTs a challenge. Adjusting for time trends is recommended, appropriate parameterizations depend on the particular context. For sample size calculation, the covariance structure and covariance parameters are usually assumed to be known. These assumptions greatly affect the influence single cluster-period cells have on the effect estimate. Thus, it is important to understand how cluster-period cells contribute to the treatment effect estimate. We therefore discuss two measures of cell influence. These are functions of the design characteristics and covariance structure only and can thus be calculated at the planning stage: the coefficient matrix as discussed by Matthews and Forbes and information content (IC) as introduced by Kasza and Forbes. The main result is a new formula for IC that is more general and computationally more efficient. The formula applies to any generalized least squares estimator, especially for any type of time trend adjustment or nonblock diagonal matrices. We further show a functional relationship between IC and the coefficient matrix. We give two examples that tie in with current literature. All discussed tools and methods are implemented in the R package SteppedPower.
Collapse
Affiliation(s)
- Philipp Mildenberger
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center Johannes Gutenberg University Mainz, Mainz, Germany
| | - Jochem König
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center Johannes Gutenberg University Mainz, Mainz, Germany
| |
Collapse
|
37
|
Abstract
BACKGROUND This article identifies the most influential methods reports for group-randomized trials and related designs published through 2020. Many interventions are delivered to participants in real or virtual groups or in groups defined by a shared interventionist so that there is an expectation for positive correlation among observations taken on participants in the same group. These interventions are typically evaluated using a group- or cluster-randomized trial, an individually randomized group treatment trial, or a stepped wedge group- or cluster-randomized trial. These trials face methodological issues beyond those encountered in the more familiar individually randomized controlled trial. METHODS PubMed was searched to identify candidate methods reports; that search was supplemented by reports known to the author. Candidate reports were reviewed by the author to include only those focused on the designs of interest. Citation counts and the relative citation ratio, a new bibliometric tool developed at the National Institutes of Health, were used to identify influential reports. The relative citation ratio measures influence at the article level by comparing the citation rate of the reference article to the citation rates of the articles cited by other articles that also cite the reference article. RESULTS In total, 1043 reports were identified that were published through 2020. However, 55 were deemed to be the most influential based on their relative citation ratio or their citation count using criteria specific to each of the three designs, with 32 group-randomized trial reports, 7 individually randomized group treatment trial reports, and 16 stepped wedge group-randomized trial reports. Many of the influential reports were early publications that drew attention to the issues that distinguish these designs from the more familiar individually randomized controlled trial. Others were textbooks that covered a wide range of issues for these designs. Others were "first reports" on analytic methods appropriate for a specific type of data (e.g. binary data, ordinal data), for features commonly encountered in these studies (e.g. unequal cluster size, attrition), or for important variations in study design (e.g. repeated measures, cohort versus cross-section). Many presented methods for sample size calculations. Others described how these designs could be applied to a new area (e.g. dissemination and implementation research). Among the reports with the highest relative citation ratios were the CONSORT statements for each design. CONCLUSIONS Collectively, the influential reports address topics of great interest to investigators who might consider using one of these designs and need guidance on selecting the most appropriate design for their research question and on the best methods for design, analysis, and sample size.
Collapse
Affiliation(s)
- David M Murray
- Office of Disease Prevention, National Institutes of Health, North Bethesda, MD, USA
| |
Collapse
|
38
|
Li F, Yu H, Rathouz PJ, Turner EL, Preisser JS. Marginal modeling of cluster-period means and intraclass correlations in stepped wedge designs with binary outcomes. Biostatistics 2022; 23:772-788. [PMID: 33527999 PMCID: PMC9291643 DOI: 10.1093/biostatistics/kxaa056] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 11/30/2020] [Indexed: 01/09/2023] Open
Abstract
Stepped wedge cluster randomized trials (SW-CRTs) with binary outcomes are increasingly used in prevention and implementation studies. Marginal models represent a flexible tool for analyzing SW-CRTs with population-averaged interpretations, but the joint estimation of the mean and intraclass correlation coefficients (ICCs) can be computationally intensive due to large cluster-period sizes. Motivated by the need for marginal inference in SW-CRTs, we propose a simple and efficient estimating equations approach to analyze cluster-period means. We show that the quasi-score for the marginal mean defined from individual-level observations can be reformulated as the quasi-score for the same marginal mean defined from the cluster-period means. An additional mapping of the individual-level ICCs into correlations for the cluster-period means further provides a rigorous justification for the cluster-period approach. The proposed approach addresses a long-recognized computational burden associated with estimating equations defined based on individual-level observations, and enables fast point and interval estimation of the intervention effect and correlations. We further propose matrix-adjusted estimating equations to improve the finite-sample inference for ICCs. By providing a valid approach to estimate ICCs within the class of generalized linear models for correlated binary outcomes, this article operationalizes key recommendations from the CONSORT extension to SW-CRTs, including the reporting of ICCs.
Collapse
Affiliation(s)
- Fan Li
- To whom correspondence should be addressed.
| | - Hengshi Yu
- Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48109, USA
| | - Paul J Rathouz
- Department of Population Health, The University of Texas at Austin, Dell Medical School, 1601 Trinity St, Bldg. B, Austin, TX 78712, USA
| | - Elizabeth L Turner
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, 2424 Erwin Rd, Durham, NC 27710, USA
| | - John S Preisser
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27514, USA
| |
Collapse
|
39
|
Adisso ÉL, Taljaard M, Stacey D, Brière N, Zomahoun HTV, Durand PJ, Rivest LP, Légaré F. Does Adding Training in Shared Decision Making for Home Care Teams to Providing Decision Guides Better Engage Frail Elders and Caregivers in Housing Decisions? :A Stepped-Wedge Cluster Randomized Trial. JMIR Aging 2022; 5:e39386. [PMID: 35759791 PMCID: PMC9533197 DOI: 10.2196/39386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 06/06/2022] [Accepted: 06/27/2022] [Indexed: 11/24/2022] Open
Abstract
Background Frail older adults and caregivers need support from their home care teams in making difficult housing decisions, such as whether to remain at home, with or without assistance, or move into residential care. However, home care teams are often understaffed and busy, and shared decision-making training is costly. Nevertheless, overall awareness of shared decision-making is increasing. We hypothesized that distributing a decision aid could be sufficient for providing decision support without the addition of shared decision-making training for home care teams. Objective We evaluated the effectiveness of adding web-based training and workshops for care teams in interprofessional shared decision-making to passive dissemination of a decision guide on the proportion of frail older adults or caregivers of cognitively-impaired frail older adults reporting active roles in housing decision-making. Methods We conducted a stepped-wedge cluster randomized trial with home care teams in 9 health centers in Quebec, Canada. Participants were frail older adults or caregivers of cognitively impaired frail older adults facing housing decisions and receiving care from the home care team at one of the participating health centers. The intervention consisted of a 1.5-hour web-based tutorial for the home care teams plus a 3.5-hour interactive workshop in interprofessional shared decision-making using a decision guide that was designed to support frail older adults and caregivers in making housing decisions. The control was passive dissemination of the decision guide. The primary outcome was an active role in decision-making among frail older adults and caregivers, measured using the Control Preferences Scale. Secondary outcomes included decisional conflict and perceptions of how much care teams involved frail older adults and caregivers in decision-making. We performed an intention-to-treat analysis. Results A total of 311 frail older adults were included in the analysis, including 208 (66.9%) women, with a mean age of 81.2 (SD 7.5) years. Among 339 caregivers of cognitively-impaired frail older adults, 239 (70.5%) were female and their mean age was 66.4 (SD 11.7) years. The intervention increased the proportion of frail older adults reporting an active role in decision-making by 3.3% (95% CI –5.8% to 12.4%, P=.47) and the proportion of caregivers of cognitively-impaired frail older adults by 6.1% (95% CI -11.2% to 23.4%, P=.49). There was no significant impact on the secondary outcomes. However, the mean score for the frail older adults’ perception of how much health professionals involved them in decision-making increased by 5.4 (95% CI −0.6 to 11.4, P=.07) and the proportion of caregivers who reported decisional conflict decreased by 7.5% (95% CI −16.5% to 1.6%, P=.10). Conclusions Although it slightly reduced decisional conflict for caregivers, shared decision-making training did not equip home care teams significantly better than provision of a decision aid for involving frail older adults and their caregivers in decision-making. Trial Registration ClinicalTrials.gov NCT02592525; https://clinicaltrials.gov/show/NCT02592525
Collapse
Affiliation(s)
- Évèhouénou Lionel Adisso
- Tier 1 Canada Research Chair in Shared Decision Making and Knowledge Translation, 2525 Chemin de la Canardière bureau A-3421, Québec, CA.,VITAM - Centre de recherche en santé durable, Quebec, QC, CA.,Department of Social and Preventive Medicine, Faculty of Medicine, Université Laval, Quebec, QC, CA
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, CA.,School of Epidemiology and Public Health, University of Ottawa, Ottawa, CA
| | - Dawn Stacey
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, CA.,School of Nursing, University of Ottawa, Ottawa, CA
| | - Nathalie Brière
- Centre intégré universitaire de santé et de services sociaux (CIUSSS) de la Capitale-Nationale, Direction des services multidisciplinaires, Quebec, QC, CA
| | - Hervé Tchala Vignon Zomahoun
- Tier 1 Canada Research Chair in Shared Decision Making and Knowledge Translation, 2525 Chemin de la Canardière bureau A-3421, Québec, CA.,VITAM - Centre de recherche en santé durable, Quebec, QC, CA.,Department of Social and Preventive Medicine, Faculty of Medicine, Université Laval, Quebec, QC, CA.,Health and Social Services Systems, Knowledge Translation and Implementation component of the Quebec SPOR-SUPPORT Unit, Quebec, QC, CA.,Faculty of Medicine, School of Physical and Occupational Therapy, McGill University, Montreal, QC, CA
| | - Pierre Jacob Durand
- Department of Social and Preventive Medicine, Faculty of Medicine, Université Laval, Quebec, QC, CA
| | - Louis-Paul Rivest
- Department of Mathematics and Statistics, Université Laval, Quebec, QC, CA.,Canada Research Chair in Statistical Sampling and Data Analysis, Laval University, Quebec, QC, CA
| | - France Légaré
- Tier 1 Canada Research Chair in Shared Decision Making and Knowledge Translation, 2525 Chemin de la Canardière bureau A-3421, Québec, CA.,VITAM - Centre de recherche en santé durable, 2525 Chemin de la Canardière bureau A-3421, Québec, CA.,Health and Social Services Systems, Knowledge Translation and Implementation component of the Quebec SPOR-SUPPORT Unit, 2525 Chemin de la Canardière bureau A-3421, Québec, CA.,Department of Family Medicine and Emergency Medicine, Faculty of Medicine, Université Laval, Quebec, QC, Canada, 2325 Rue de l'Université, Québec, QC G1V 0A6, QUEBEC, CA
| |
Collapse
|
40
|
Rezaei-Darzi E, Kasza J, Forbes A, Bowden R. Use of information criteria for selecting a correlation structure for longitudinal cluster randomised trials. Clin Trials 2022; 19:316-325. [PMID: 35706343 DOI: 10.1177/17407745221082227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
BACKGROUND When designing and analysing longitudinal cluster randomised trials, such as the stepped wedge, the similarity of outcomes from the same cluster must be accounted for through the choice of a form for the within-cluster correlation structure. Several choices for this structure are commonly considered for application within the linear mixed model paradigm. The first assumes a constant intra-cluster correlation for all pairs of outcomes from the same cluster (the exchangeable/Hussey and Hughes model); the second assumes that correlations of outcomes measured in the same period are higher than outcomes measured in different periods (the block exchangeable model) and the third is the discrete-time decay model, which allows the correlation between pairs of outcomes to decay over time. Currently, there is limited guidance on how to select the most appropriate within-cluster correlation structure. METHODS We simulated continuous outcomes under each of the three considered within-cluster correlation structures for a range of design and parameter choices, and, using the ASReml-R package, fit each linear mixed model to each simulated dataset. We evaluated the performance of the Akaike and Bayesian information criteria for selecting the correct within-cluster correlation structure for each dataset. RESULTS For smaller total sample sizes, neither criteria performs particularly well in selecting the correct within-cluster correlation structure, with the simpler exchangeable model being favoured. Furthermore, in general, the Bayesian information criterion favours the exchangeable model. When the cluster auto-correlation (which defines the degree of dependence between observations in adjacent time periods) is large and number of periods is small, neither criteria is able to distinguish between the block exchangeable and discrete time decay models. However, for increasing numbers of clusters, periods, and subjects per cluster period, both the Akaike and Bayesian information criteria perform increasingly well in the detection of the correct within-cluster correlation structure. CONCLUSIONS With increasing amounts of data, be they number of clusters, periods or subjects per cluster period, both the Akaike and Bayesian information criteria are increasingly likely to select the correct correlation structure. We recommend that if there are sufficient data available when planning a trial, that the Akaike or Bayesian information criterion is used to guide the choice of within-cluster correlation structure in the absence of other compelling justifications for a specific correlation structure. We also suggest that researchers conduct supplementary analyses under alternate correlation structures to gauge sensitivity to the initial choice.
Collapse
Affiliation(s)
- Ehsan Rezaei-Darzi
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia.,Monash University Accident Research Centre, Monash University, Clayton, VIC, Australia
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Andrew Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - Rhys Bowden
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| |
Collapse
|
41
|
Kasza J, Bowden R, Hooper R, Forbes AB. The batched stepped wedge design: A design robust to delays in cluster recruitment. Stat Med 2022; 41:3627-3641. [PMID: 35596691 PMCID: PMC9541502 DOI: 10.1002/sim.9438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 04/13/2022] [Accepted: 05/05/2022] [Indexed: 11/08/2022]
Abstract
Stepped wedge designs are an increasingly popular variant of longitudinal cluster randomized trial designs, and roll out interventions across clusters in a randomized, but step-wise fashion. In the standard stepped wedge design, assumptions regarding the effect of time on outcomes may require that all clusters start and end trial participation at the same time. This would require ethics approvals and data collection procedures to be in place in all clusters before a stepped wedge trial can start in any cluster. Hence, although stepped wedge designs are useful for testing the impacts of many cluster-based interventions on outcomes, there can be lengthy delays before a trial can commence. In this article, we introduce "batched" stepped wedge designs. Batched stepped wedge designs allow clusters to commence the study in batches, instead of all at once, allowing for staggered cluster recruitment. Like the stepped wedge, the batched stepped wedge rolls out the intervention to all clusters in a randomized and step-wise fashion: a series of self-contained stepped wedge designs. Provided that separate period effects are included for each batch, software for standard stepped wedge sample size calculations can be used. With this time parameterization, in many situations including when linear models are assumed, sample size calculations reduce to the setting of a single stepped wedge design with multiple clusters per sequence. In these situations, sample size calculations will not depend on the delays between the commencement of batches. Hence, the power of batched stepped wedge designs is robust to unexpected delays between batches.
Collapse
Affiliation(s)
- Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Rhys Bowden
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Richard Hooper
- Centre for Primary Care and Public Health, Queen Mary University of London, London, UK
| | - Andrew B Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| |
Collapse
|
42
|
Li F, Wang R. Stepped Wedge Cluster Randomized Trials: A Methodological Overview. World Neurosurg 2022; 161:323-330. [PMID: 35505551 PMCID: PMC9074087 DOI: 10.1016/j.wneu.2021.10.136] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 10/17/2021] [Accepted: 10/18/2021] [Indexed: 11/21/2022]
Abstract
BACKGROUND Stepped wedge cluster randomized trials enable rigorous evaluations of health intervention programs in pragmatic settings. In the present study, we aimed to update neurosurgeon scientists on the design of stepped wedge randomized trials. METHODS We have presented an overview of recent methodological developments for stepped wedge designs and included an update on the newer associated methodological tools to aid with future study designs. RESULTS We defined the stepped wedge trial design and reviewed the indications for the design in depth. In addition, key considerations, including mainstream methods of analysis and sample size determination, were discussed. CONCLUSIONS Stepped wedge designs can be attractive for study intervention programs aiming to improve the delivery of patient care, especially when examining a small number of heterogeneous clusters.
Collapse
Affiliation(s)
- Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA; Center for Methods in Implementation and Prevention Science, Yale University, New Haven, Connecticut, USA
| | - Rui Wang
- Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, Massachusetts, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.
| |
Collapse
|
43
|
Grantham KL, Kasza J, Heritier S, Carlin JB, Forbes AB. Evaluating the performance of Bayesian and restricted maximum likelihood estimation for stepped wedge cluster randomized trials with a small number of clusters. BMC Med Res Methodol 2022; 22:112. [PMID: 35418034 PMCID: PMC9009029 DOI: 10.1186/s12874-022-01550-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 02/02/2022] [Indexed: 11/25/2022] Open
Abstract
Background Stepped wedge trials are an appealing and potentially powerful cluster randomized trial design. However, they are frequently implemented with a small number of clusters. Standard analysis methods for these trials such as a linear mixed model with estimation via maximum likelihood or restricted maximum likelihood (REML) rely on asymptotic properties and have been shown to yield inflated type I error when applied to studies with a small number of clusters. Small-sample methods such as the Kenward-Roger approximation in combination with REML can potentially improve estimation of the fixed effects such as the treatment effect. A Bayesian approach may also be promising for such multilevel models but has not yet seen much application in cluster randomized trials. Methods We conducted a simulation study comparing the performance of REML with and without a Kenward-Roger approximation to a Bayesian approach using weakly informative prior distributions on the intracluster correlation parameters. We considered a continuous outcome and a range of stepped wedge trial configurations with between 4 and 40 clusters. To assess method performance we calculated bias and mean squared error for the treatment effect and correlation parameters and the coverage of 95% confidence/credible intervals and relative percent error in model-based standard error for the treatment effect. Results Both REML with a Kenward-Roger standard error and degrees of freedom correction and the Bayesian method performed similarly well for the estimation of the treatment effect, while intracluster correlation parameter estimates obtained via the Bayesian method were less variable than REML estimates with different relative levels of bias. Conclusions The use of REML with a Kenward-Roger approximation may be sufficient for the analysis of stepped wedge cluster randomized trials with a small number of clusters. However, a Bayesian approach with weakly informative prior distributions on the intracluster correlation parameters offers a viable alternative, particularly when there is interest in the probability-based inferences permitted within this paradigm. Supplementary Information The online version contains supplementary material available at (10.1186/s12874-022-01550-8).
Collapse
Affiliation(s)
- Kelsey L Grantham
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - Jessica Kasza
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - Stephane Heritier
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - John B Carlin
- Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Parkville, Australia.,Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, University of Melbourne, Carlton, Australia
| | - Andrew B Forbes
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Australia.
| |
Collapse
|
44
|
Lee KM, Ma X, Yang GM, Cheung YB. Inclusion of unexposed clusters improves the precision of fixed effects analysis of stepped‐wedge cluster randomized trials. Stat Med 2022; 41:2923-2938. [DOI: 10.1002/sim.9394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 03/07/2022] [Accepted: 03/11/2022] [Indexed: 11/12/2022]
Affiliation(s)
| | - Xiangmei Ma
- Centre for Quantitative Medicine Duke‐NUS Medical School Singapore
| | - Grace Meijuan Yang
- Division of Supportive and Palliative Care National Cancer Centre Singapore Singapore
- Lien Centre for Palliative Care Duke‐NUS Medical School Singapore
| | - Yin Bun Cheung
- Centre for Quantitative Medicine Duke‐NUS Medical School Singapore
- Signature Programme in Health Services & Systems Research Duke‐NUS Medical School Singapore
- Tampere Center for Child, Adolescent and Maternal Health Research Tampere University Tampere Finland
| |
Collapse
|
45
|
Mariathas HH, Hurley O, Anaraki NR, Young C, Patey C, Norman P, Aubrey-Bassler K, Wang PP, Gadag V, Nguyen HV, Etchegary H, McCrate F, Knight JC, Asghari S. A Quality Improvement Emergency Department Surge Management Platform (SurgeCon): Protocol for a Stepped Wedge Cluster Randomized Trial. JMIR Res Protoc 2022; 11:e30454. [PMID: 35323121 PMCID: PMC8990381 DOI: 10.2196/30454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 12/16/2021] [Accepted: 12/18/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Despite many efforts, long wait times and overcrowding in emergency departments (EDs) have remained a significant health service issue in Canada. For several years, Canada has had one of the longest wait times among the Organisation for Economic Co-operation and Development countries. From a patient's perspective, this challenge has been described as "patients wait in pain or discomfort for hours before being seen at EDs." To overcome the challenge of increased wait times, we developed an innovative ED management platform called SurgeCon that was designed based on continuous quality improvement principles to maintain patient flow and mitigate the impact of patient surge on ED efficiency. The SurgeCon quality improvement intervention includes a protocol-driven software platform, restructures ED organization and workflow, and aims to establish a more patient-centric environment. We piloted SurgeCon at an ED in Carbonear, Newfoundland and Labrador, and found that there was a 32% reduction in ED wait times. OBJECTIVE The primary objective of this trial is to determine the effects of SurgeCon on ED performance by assessing its impact on length of stay, the time to a physician's initial assessment, and the number of patients leaving the ED without being seen by a physician. The secondary objectives of this study are to evaluate SurgeCon's effects on patient satisfaction and patient-reported experiences with ED wait times and its ability to create better-value care by reducing the per-patient cost of delivering ED services. METHODS The implementation of the intervention will be assessed using a comparative effectiveness-implementation hybrid design. This type of hybrid design is known to shorten the amount of time associated with transitioning interventions from being the focus of research to being used for practice and health care services. All EDs with 24/7 on-site physician support (category A hospitals) will be enrolled in a 31-month, pragmatic, stepped wedge cluster randomized trial. All clusters (hospitals) will start with a baseline period of usual care and will be randomized to determine the order and timing of transitioning to intervention care until all hospitals are using the intervention to manage and operationalize their EDs. RESULTS Data collection for this study is continuing. As of February 2022, a total of 570 randomly selected patients have participated in telephone interviews concerning patient-reported experiences and patient satisfaction with ED wait times. The first of the 4 EDs was randomly selected, and it is currently using SurgeCon's eHealth platform and applying efficiency principles that have been learned through training since September 2021. The second randomly selected site will begin intervention implementation in winter 2022. CONCLUSIONS By assessing the impact of SurgeCon on ED services, we hope to be able to improve wait times and create better-value ED care in this health care context. TRIAL REGISTRATION ClinicalTrials.gov NCT04789902; https://clinicaltrials.gov/ct2/show/NCT04789902. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) DERR1-10.2196/30454.
Collapse
Affiliation(s)
- Hensley H Mariathas
- Centre for Rural Health Studies, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Oliver Hurley
- Centre for Rural Health Studies, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Nahid Rahimipour Anaraki
- Centre for Rural Health Studies, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Christina Young
- Centre for Rural Health Studies, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Christopher Patey
- Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada.,Eastern Health, Carbonear Institute for Rural Reach and Innovation by the Sea, Carbonear General Hospital, Carbonear, NL, Canada
| | - Paul Norman
- Eastern Health, Carbonear Institute for Rural Reach and Innovation by the Sea, Carbonear General Hospital, Carbonear, NL, Canada
| | - Kris Aubrey-Bassler
- Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Peizhong Peter Wang
- Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada.,Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Veeresh Gadag
- Division of Community Health and Humanities, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Hai V Nguyen
- School of Pharmacy, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Holly Etchegary
- Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Farah McCrate
- Department of Research and Innovation, Eastern Health, St. John's, NL, Canada
| | - John C Knight
- Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada.,Newfoundland and Labrador Centre for Health Information, St. John's, NL, Canada
| | - Shabnam Asghari
- Centre for Rural Health Studies, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL, Canada
| |
Collapse
|
46
|
Harrison LJ, Wang R. Power calculation for analyses of cross-sectional stepped-wedge cluster randomized trials with binary outcomes via generalized estimating equations. Stat Med 2021; 40:6674-6688. [PMID: 34558112 DOI: 10.1002/sim.9205] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 08/31/2021] [Accepted: 09/06/2021] [Indexed: 11/08/2022]
Abstract
Power calculation for stepped-wedge cluster randomized trials (SW-CRTs) presents unique challenges, beyond those of standard parallel cluster randomized trials, due to the need to consider temporal within cluster correlations and background period effects. To date, power calculation methods specific to SW-CRTs have primarily been developed under a linear model. When the outcome is binary, the use of a linear model corresponds to assessing a prevalence difference; yet trial analysis often employs a nonlinear link function. We propose power calculation methods for cross-sectional SW-CRTs under a logistic model fitted by generalized estimating equations. Firstly, under an exchangeable correlation structure, we show the power based on a logistic model is lower than that from assuming a linear model in the absence of period effects. We then evaluate the impact of background prevalence changes over time on power. To allow the correlation among outcomes in the same cluster to change over time and with treatment status, we generalize the methods to more complex correlation structures. Our simulation studies demonstrate that the proposed power calculation methods perform well with the model-based variance under the true correlation structure and reveal that a working independence structure can result in substantial efficiency loss, while a working exchangeable structure performs well even when the underlying correlation structure deviates from exchangeable. An extension to our methods accounts for variable cluster sizes and reveals that unequal cluster sizes have a modest impact on power. We illustrate the approaches by application to a quality of care improvement trial for acute coronary syndrome.
Collapse
Affiliation(s)
- Linda J Harrison
- Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, USA
| | - Rui Wang
- Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, USA.,Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, Massachusetts, USA
| |
Collapse
|
47
|
Tian Z, Preisser JS, Esserman D, Turner EL, Rathouz PJ, Li F. Impact of unequal cluster sizes for GEE analyses of stepped wedge cluster randomized trials with binary outcomes. Biom J 2021; 64:419-439. [PMID: 34596912 PMCID: PMC9292617 DOI: 10.1002/bimj.202100112] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 07/15/2021] [Accepted: 08/07/2021] [Indexed: 12/31/2022]
Abstract
The stepped wedge (SW) design is a type of unidirectional crossover design where cluster units switch from control to intervention condition at different prespecified time points. While a convention in study planning is to assume the cluster‐period sizes are identical, SW cluster randomized trials (SW‐CRTs) involving repeated cross‐sectional designs frequently have unequal cluster‐period sizes, which can impact the efficiency of the treatment effect estimator. In this paper, we provide a comprehensive investigation of the efficiency impact of unequal cluster sizes for generalized estimating equation analyses of SW‐CRTs, with a focus on binary outcomes as in the Washington State Expedited Partner Therapy trial. Several major distinctions between our work and existing work include the following: (i) we consider multilevel correlation structures in marginal models with binary outcomes; (ii) we study the implications of both the between‐cluster and within‐cluster imbalances in sizes; and (iii) we provide a comparison between the independence working correlation versus the true working correlation and detail the consequences of ignoring correlation estimation in SW‐CRTs with unequal cluster sizes. We conclude that the working independence assumption can lead to substantial efficiency loss and a large sample size regardless of cluster‐period size variability in SW‐CRTs, and recommend accounting for correlations in the analysis. To improve study planning, we additionally provide a computationally efficient search algorithm to estimate the sample size in SW‐CRTs accounting for unequal cluster‐period sizes, and conclude by illustrating the proposed approach in the context of the Washington State study.
Collapse
Affiliation(s)
- Zibo Tian
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA
| | - John S Preisser
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Denise Esserman
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA.,Yale Center for Analytical Sciences, New Haven, CT, USA
| | - Elizabeth L Turner
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA.,Duke Global Health Institute, Durham, NC, USA
| | - Paul J Rathouz
- Department of Population Health, The University of Texas at Austin, Austin, TX, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA.,Yale Center for Analytical Sciences, New Haven, CT, USA.,Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, USA
| |
Collapse
|
48
|
Rabideau DJ, Wang R. Randomization-based inference for a marginal treatment effect in stepped wedge cluster randomized trials. Stat Med 2021; 40:4442-4456. [PMID: 34018624 PMCID: PMC9014477 DOI: 10.1002/sim.9040] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 05/05/2021] [Accepted: 05/05/2021] [Indexed: 12/17/2022]
Abstract
In a cross-sectional stepped wedge cluster randomized trial (SWT), clusters are randomized to crossover from control to intervention at different time periods and outcomes are assessed for a different set of individuals in each cluster-period. Randomization-based inference is an attractive analysis strategy for SWTs because it does not require full parametric specification of the outcome distribution or correlation structure and its validity does not rely on having a large number of clusters. Existing randomization-based approaches for SWTs, however, either focus on hypothesis testing and omit technical details on confidence interval (CI) calculation with noncontinuous outcomes, or employ weighted cluster-period summary statistics for p-value and CI calculation, which can result in suboptimal efficiency if weights do not incorporate information on varying cluster-period sizes. In this article, we propose a framework for calculating randomization-based p-values and CIs for a marginal treatment effect in SWTs by using test statistics derived from individual-level generalized linear models. We also investigate how study design features, such as stratified randomization, subsequently impact various SWT analysis methods including the proposed approach. Data from the XpertMTB/RIF tuberculosis trial are reanalyzed to illustrate our method and compare it to alternatives.
Collapse
Affiliation(s)
- Dustin J. Rabideau
- Biostatistics Center, Massachusetts General Hospital, Boston, Massachusetts
- Department of Medicine, Harvard Medical School, Boston, Massachusetts
| | - Rui Wang
- Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, Massachusetts
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts
| |
Collapse
|
49
|
Xia F, Hughes JP, Voldal EC, Heagerty PJ. Power and sample size calculation for stepped-wedge designs with discrete outcomes. Trials 2021; 22:598. [PMID: 34488848 PMCID: PMC8419932 DOI: 10.1186/s13063-021-05542-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 08/13/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Stepped-wedge designs (SWD) are increasingly used to evaluate the impact of changes to the process of care within health care systems. However, to generate definitive evidence, a correct sample size calculation is crucial to ensure such studies are properly powered. The seminal work of Hussey and Hughes (Contemp Clin Trials 28(2):182-91, 2004) provides an analytical formula for power calculations with normal outcomes using a linear model and simple random effects. However, minimal development and evaluation have been done for power calculation with non-normal outcomes on their natural scale (e.g., logit, log). For example, binary endpoints are common, and logistic regression is the natural multilevel model for such clustered data. METHODS We propose a power calculation formula for SWD with either normal or non-normal outcomes in the context of generalized linear mixed models by adopting the Laplace approximation detailed in Breslow and Clayton (J Am Stat Assoc 88(421):9-25, 1993) to obtain the covariance matrix of the estimated parameters. RESULTS We compare the performance of our proposed method with simulation-based sample size calculation and demonstrate its use on a study of patient-delivered partner therapy for STI treatment and a study that assesses the impact of providing additional benchmark prevalence information in a radiologic imaging report. To facilitate adoption of our methods we also provide a function embedded in the R package "swCRTdesign" for sample size and power calculation for multilevel stepped-wedge designs. CONCLUSIONS Our method requires minimal computational power. Therefore, the proposed procedure facilitates rapid dynamic updates of sample size calculations and can be used to explore a wide range of design options or assumptions.
Collapse
Affiliation(s)
- Fan Xia
- National Alzheimer's Coordinating Center, University of Washington, Seattle, WA, USA.
| | - James P Hughes
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Emily C Voldal
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | | |
Collapse
|
50
|
RAFF-3 Trial: A Stepped-Wedge Cluster Randomized Trial to Improve Care of Acute Atrial Fibrillation and Flutter in the Emergency Department. Can J Cardiol 2021; 37:1569-1577. [PMID: 34217808 DOI: 10.1016/j.cjca.2021.06.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Revised: 06/21/2021] [Accepted: 06/23/2021] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND We sought to improve care of patients with acute atrial fibrillation (AF) and flutter (AFL) in the emergency department (ED) by implementing the CAEP AAFF Best Practice Checklist. METHODS We conducted a stepped-wedge cluster randomized trial at 11 large community and academic hospital EDs, in five Canadian provinces and enrolled consecutive AF/AFL patients. The study intervention was the introduction of the CAEP Checklist using a knowledge translation-implementation approach that included behavior change techniques and organization/system level strategies. The primary outcome was length of stay in ED and secondary outcomes were discharge home, use of rhythm control, adverse events, and 30-day status. Analysis used mixed effects regression adjusting for covariates. RESULTS Patient visits in the control (N=314) and intervention (N=404) periods were similar with mean age 62.9, 54% male, 71% onset <12 hours, and 86% atrial fibrillation, 14% atrial flutter. We observed a reduction in length of stay of 20.9% (95% CI 5.5 to 33.8%, P=0.01), an increase in use of rhythm control (adjusted odds ratio (OR 4.5, 1.8-11.6; P=0.002), and decrease in use of rate control medications (OR 0.5, 0.2 to 0.9; P=0.02). There was no change in adverse events and no strokes or deaths by 30 days. CONCLUSIONS The RAFF-3 Trial led to optimized care of AF/AFL patients with decreased ED lengths of stay, increased ED rhythm control by drug or electricity, and no increase in adverse events. Early cardioversion allows AF/AFL patients to quickly resume normal activities. CLINICALTRIALS. GOV IDENTIFIER NCT03627143.
Collapse
|