1
|
Wang X, Turner EL, Li F. Designing individually randomized group treatment trials with repeated outcome measurements using generalized estimating equations. Stat Med 2024; 43:358-378. [PMID: 38009329 PMCID: PMC10939061 DOI: 10.1002/sim.9966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Revised: 11/04/2023] [Accepted: 11/08/2023] [Indexed: 11/28/2023]
Abstract
Individually randomized group treatment (IRGT) trials, in which the clustering of outcome is induced by group-based treatment delivery, are increasingly popular in public health research. IRGT trials frequently incorporate longitudinal measurements, of which the proper sample size calculations should account for correlation structures reflecting both the treatment-induced clustering and repeated outcome measurements. Given the relatively sparse literature on designing longitudinal IRGT trials, we propose sample size procedures for continuous and binary outcomes based on the generalized estimating equations approach, employing the block exchangeable correlation structures with different correlation parameters for the treatment arm and for the control arm, and surveying five marginal mean models with different assumptions of time effect: no-time constant treatment effect, linear-time constant treatment effect, categorical-time constant treatment effect, linear time by treatment interaction, and categorical time by treatment interaction. Closed-form sample size formulas are derived for continuous outcomes, which depends on the eigenvalues of the correlation matrices; detailed numerical sample size procedures are proposed for binary outcomes. Through simulations, we demonstrate that the empirical power agrees well with the predicted power, for as few as eight groups formed in the treatment arm, when data are analyzed using the matrix-adjusted estimating equations for the correlation parameters with a bias-corrected sandwich variance estimator.
Collapse
Affiliation(s)
- Xueqi Wang
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, 06511, USA
| | - Elizabeth L. Turner
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, 27710, USA
- Duke Global Health Institute, Duke University, Durham, NC, 27710, USA
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, 06511, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, 06511, USA
| |
Collapse
|
2
|
Zhu AY, Mitra N, Hemming K, Harhay MO, Li F. Leveraging baseline covariates to analyze small cluster-randomized trials with a rare binary outcome. Biom J 2024; 66:e2200135. [PMID: 37035941 PMCID: PMC10562517 DOI: 10.1002/bimj.202200135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 11/20/2022] [Accepted: 02/08/2023] [Indexed: 04/11/2023]
Abstract
Cluster-randomized trials (CRTs) involve randomizing entire groups of participants-called clusters-to treatment arms but are often comprised of a limited or fixed number of available clusters. While covariate adjustment can account for chance imbalances between treatment arms and increase statistical efficiency in individually randomized trials, analytical methods for individual-level covariate adjustment in small CRTs have received little attention to date. In this paper, we systematically investigate, through extensive simulations, the operating characteristics of propensity score weighting and multivariable regression as two individual-level covariate adjustment strategies for estimating the participant-average causal effect in small CRTs with a rare binary outcome and identify scenarios where each adjustment strategy has a relative efficiency advantage over the other to make practical recommendations. We also examine the finite-sample performance of the bias-corrected sandwich variance estimators associated with propensity score weighting and multivariable regression for quantifying the uncertainty in estimating the participant-average treatment effect. To illustrate the methods for individual-level covariate adjustment, we reanalyze a recent CRT testing a sedation protocol in 31 pediatric intensive care units.
Collapse
Affiliation(s)
- Angela Y. Zhu
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, United States of America
| | - Nandita Mitra
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, United States of America
| | - Karla Hemming
- Department of Public Health, Epidemiology, and Biostatistics, University of Birmingham Institute of Applied Health Research, Birmingham B15 2TT, United Kingdom
| | - Michael O. Harhay
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, United States of America
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, United States of America
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, CT 06510, United States of America
| |
Collapse
|
3
|
Kahan BC, Li F, Blette B, Jairath V, Copas A, Harhay M. Informative cluster size in cluster-randomised trials: A case study from the TRIGGER trial. Clin Trials 2023; 20:661-669. [PMID: 37439089 PMCID: PMC10638852 DOI: 10.1177/17407745231186094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/14/2023]
Abstract
BACKGROUND Recent work has shown that cluster-randomised trials can estimate two distinct estimands: the participant-average and cluster-average treatment effects. These can differ when participant outcomes or the treatment effect depends on the cluster size (termed informative cluster size). In this case, estimators that target one estimand (such as the analysis of unweighted cluster-level summaries, which targets the cluster-average effect) may be biased for the other. Furthermore, commonly used estimators such as mixed-effects models or generalised estimating equations with an exchangeable correlation structure can be biased for both estimands. However, there has been little empirical research into whether informative cluster size is likely to occur in practice. METHOD We re-analysed a cluster-randomised trial comparing two different thresholds for red blood cell transfusion in patients with acute upper gastrointestinal bleeding to explore whether estimates for the participant- and cluster-average effects differed, to provide empirical evidence for whether informative cluster size may be present. For each outcome, we first estimated a participant-average effect using independence estimating equations, which are unbiased under informative cluster size. We then compared this to two further methods: (1) a cluster-average effect estimated using either weighted independence estimating equations or unweighted cluster-level summaries, and (2) estimates from a mixed-effects model or generalised estimating equations with an exchangeable correlation structure. We then performed a small simulation study to evaluate whether observed differences between cluster- and participant-average estimates were likely to occur even if no informative cluster size was present. RESULTS For most outcomes, treatment effect estimates from different methods were similar. However, differences of >10% occurred between participant- and cluster-average estimates for 5 of 17 outcomes (29%). We also observed several notable differences between estimates from mixed-effects models or generalised estimating equations with an exchangeable correlation structure and those based on independence estimating equations. For example, for the EQ-5D VAS score, the independence estimating equation estimate of the participant-average difference was 4.15 (95% confidence interval: -3.37 to 11.66), compared with 2.84 (95% confidence interval: -7.37 to 13.04) for the cluster-average independence estimating equation estimate, and 3.23 (95% confidence interval: -6.70 to 13.16) from a mixed-effects model. Similarly, for thromboembolic/ischaemic events, the independence estimating equation estimate for the participant-average odds ratio was 0.43 (95% confidence interval: 0.07 to 2.48), compared with 0.33 (95% confidence interval: 0.06 to 1.77) from the cluster-average estimator. CONCLUSION In this re-analysis, we found that estimates from the various approaches could differ, which may be due to the presence of informative cluster size. Careful consideration of the estimand and the plausibility of assumptions underpinning each estimator can help ensure an appropriate analysis methods are used. Independence estimating equations and the analysis of cluster-level summaries (with appropriate weighting for each to correspond to either the participant-average or cluster-average treatment effect) are a desirable choice when informative cluster size is deemed possible, due to their unbiasedness in this setting.
Collapse
Affiliation(s)
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, Yale University, New Haven, CT, USA
| | - Bryan Blette
- Department of Biostatistics, Epidemiology & Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Vipul Jairath
- Division of Gastroenterology, Department of Medicine, Schulich School of Medicine & Dentistry, Western University, London, ON, Canada
- Department of Epidemiology and Biostatistics, Western University, London, ON, Canada
| | | | - Michael Harhay
- Department of Biostatistics, Epidemiology & Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
4
|
Tong G, Taljaard M, Li F. Sample size considerations for assessing treatment effect heterogeneity in randomized trials with heterogeneous intracluster correlations and variances. Stat Med 2023; 42:3392-3412. [PMID: 37316956 DOI: 10.1002/sim.9811] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Revised: 02/08/2023] [Accepted: 05/19/2023] [Indexed: 06/16/2023]
Abstract
An important consideration in the design and analysis of randomized trials is the need to account for outcome observations being positively correlated within groups or clusters. Two notable types of designs with this consideration are individually randomized group treatment trials and cluster randomized trials. While sample size methods for testing the average treatment effect are available for both types of designs, methods for detecting treatment effect modification are relatively limited. In this article, we present new sample size formulas for testing treatment effect modification based on either a univariate or multivariate effect modifier in both individually randomized group treatment and cluster randomized trials with a continuous outcome but any types of effect modifier, while accounting for differences across study arms in the outcome variance, outcome intracluster correlation coefficient (ICC) and the cluster size. We consider cases where the effect modifier can be measured at either the individual level or cluster level, and with a univariate effect modifier, our closed-form sample size expressions provide insights into the optimal allocation of groups or clusters to maximize design efficiency. Overall, our results show that the required sample size for testing treatment effect heterogeneity with an individual-level effect modifier can be affected by unequal ICCs and variances between arms, and accounting for such between-arm heterogeneity can lead to more accurate sample size determination. We use simulations to validate our sample size formulas and illustrate their application in the context of two real trials: an individually randomized group treatment trial (the AWARE study) and a cluster randomized trial (the K-DPP study).
Collapse
Affiliation(s)
- Guangyu Tong
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Heath, University of Ottawa, Ottawa, Canada
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, Connecticut, USA
| |
Collapse
|
5
|
Wang X, Turner EL, Li F. Improving sandwich variance estimation for marginal Cox analysis of cluster randomized trials. Biom J 2023; 65:e2200113. [PMID: 36567265 PMCID: PMC10482495 DOI: 10.1002/bimj.202200113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 08/31/2022] [Accepted: 10/29/2022] [Indexed: 12/27/2022]
Abstract
Cluster randomized trials (CRTs) frequently recruit a small number of clusters, therefore necessitating the application of small-sample corrections for valid inference. A recent systematic review indicated that CRTs reporting right-censored, time-to-event outcomes are not uncommon and that the marginal Cox proportional hazards model is one of the common approaches used for primary analysis. While small-sample corrections have been studied under marginal models with continuous, binary, and count outcomes, no prior research has been devoted to the development and evaluation of bias-corrected sandwich variance estimators when clustered time-to-event outcomes are analyzed by the marginal Cox model. To improve current practice, we propose nine bias-corrected sandwich variance estimators for the analysis of CRTs using the marginal Cox model and report on a simulation study to evaluate their small-sample properties. Our results indicate that the optimal choice of bias-corrected sandwich variance estimator for CRTs with survival outcomes can depend on the variability of cluster sizes and can also slightly differ whether it is evaluated according to relative bias or type I error rate. Finally, we illustrate the new variance estimators in a real-world CRT where the conclusion about intervention effectiveness differs depending on the use of small-sample bias corrections. The proposed sandwich variance estimators are implemented in an R package CoxBcv.
Collapse
Affiliation(s)
- Xueqi Wang
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, 27710, USA
- Duke Global Health Institute, Durham, NC, 27710, USA
| | - Elizabeth L. Turner
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, 27710, USA
- Duke Global Health Institute, Durham, NC, 27710, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, 06511, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, 06511, USA
| |
Collapse
|
6
|
Kahan BC, Li F, Copas AJ, Harhay MO. Estimands in cluster-randomized trials: choosing analyses that answer the right question. Int J Epidemiol 2023; 52:107-118. [PMID: 35834775 PMCID: PMC9908044 DOI: 10.1093/ije/dyac131] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 06/07/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Cluster-randomized trials (CRTs) involve randomizing groups of individuals (e.g. hospitals, schools or villages) to different interventions. Various approaches exist for analysing CRTs but there has been little discussion around the treatment effects (estimands) targeted by each. METHODS We describe the different estimands that can be addressed through CRTs and demonstrate how choices between different analytic approaches can impact the interpretation of results by fundamentally changing the question being asked, or, equivalently, the target estimand. RESULTS CRTs can address either the participant-average treatment effect (the average treatment effect across participants) or the cluster-average treatment effect (the average treatment effect across clusters). These two estimands can differ when participant outcomes or the treatment effect depends on the cluster size (referred to as 'informative cluster size'), which can occur for reasons such as differences in staffing levels or types of participants between small and large clusters. Furthermore, common estimators, such as mixed-effects models or generalized estimating equations with an exchangeable working correlation structure, can produce biased estimates for both the participant-average and cluster-average treatment effects when cluster size is informative. We describe alternative estimators (independence estimating equations and cluster-level analyses) that are unbiased for CRTs even when informative cluster size is present. CONCLUSION We conclude that careful specification of the estimand at the outset can ensure that the study question being addressed is clear and relevant, and, in turn, that the selected estimator provides an unbiased estimate of the desired quantity.
Collapse
Affiliation(s)
- Brennan C Kahan
- MRC Clinical Trials Unit at UCL, Institute of Clinical Trials and Methodology, London, UK
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Prevention Science, Yale University School of Public Health, New Haven, CT, USA
| | - Andrew J Copas
- MRC Clinical Trials Unit at UCL, Institute of Clinical Trials and Methodology, London, UK
| | - Michael O Harhay
- Clinical Trials Methods and Outcomes Lab, PAIR (Palliative and Advanced Illness Research) Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
7
|
Blaha O, Esserman D, Li F. Design and analysis of cluster randomized trials with time-to-event outcomes under the additive hazards mixed model. Stat Med 2022; 41:4860-4885. [PMID: 35908796 PMCID: PMC9588628 DOI: 10.1002/sim.9541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Revised: 05/04/2022] [Accepted: 07/19/2022] [Indexed: 11/12/2022]
Abstract
A primary focus of current methods for cluster randomized trials (CRTs) has been for continuous, binary, and count outcomes, with relatively less attention given to right-censored, time-to-event outcomes. In this article, we detail considerations for sample size requirement and statistical inference in CRTs with time-to-event outcomes when the intervention effect parameter is specified through the additive hazards mixed model (AHMM), which includes a frailty term to explicitly account for the dependency between the failure times. First, we discuss improved inference for the treatment effect parameter via bias-corrected sandwich variance estimators and randomization-based test under AHMM, addressing potential small-sample biases in CRTs. Next, we derive a new sample size formula for AHMM analysis of CRTs accommodating both equal and unequal cluster sizes. When the cluster sizes vary, our sample size formula depends on the mean and coefficient of variation of cluster sizes, based on which we articulate the impact of cluster size variation in CRTs with time-to-event outcomes. Furthermore, we obtain the insight that the classical variance inflation factor for CRTs with a non-censored outcome can in fact apply to CRTs with a time-to-event outcome, providing that an appropriate definition of the intraclass correlation coefficient is considered under AHMM. Simulation studies are carried out to illustrate key design and analysis considerations in CRTs with a small to moderate number of clusters. The proposed sample size procedure and analytical methods are further illustrated using the context of the STrategies to Reduce Injuries and Develop Confidence in Elders CRT.
Collapse
Affiliation(s)
- Ondrej Blaha
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
- Yale Center for Analytical Sciences, Yale University School of Public Health, New Haven, Connecticut, USA
| | - Denise Esserman
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
- Yale Center for Analytical Sciences, Yale University School of Public Health, New Haven, Connecticut, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
- Yale Center for Analytical Sciences, Yale University School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale University School of Public Health, New Haven, Connecticut, USA
| |
Collapse
|
8
|
Chen X, Harhay MO, Li F. Clustered restricted mean survival time regression. Biom J 2022. [PMID: 35593026 DOI: 10.1002/bimj.202200002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 03/23/2022] [Accepted: 04/18/2022] [Indexed: 11/05/2022]
Abstract
For multicenter randomized trials or multilevel observational studies, the Cox regression model has long been the primary approach to study the effects of covariates on time-to-event outcomes. A critical assumption of the Cox model is the proportionality of the hazard functions for modeled covariates, violations of which can result in ambiguous interpretations of the hazard ratio estimates. To address this issue, the restricted mean survival time (RMST), defined as the mean survival time up to a fixed time in a target population, has been recommended as a model-free target parameter. In this article, we generalize the RMST regression model to clustered data by directly modeling the RMST as a continuous function of restriction times with covariates while properly accounting for within-cluster correlations to achieve valid inference. The proposed method estimates regression coefficients via weighted generalized estimating equations, coupled with a cluster-robust sandwich variance estimator to achieve asymptotically valid inference with a sufficient number of clusters. In small-sample scenarios where a limited number of clusters are available, however, the proposed sandwich variance estimator can exhibit negative bias in capturing the variability of regression coefficient estimates. To overcome this limitation, we further propose and examine bias-corrected sandwich variance estimators to reduce the negative bias of the cluster-robust sandwich variance estimator. We study the finite-sample operating characteristics of proposed methods through simulations and reanalyze two multicenter randomized trials.
Collapse
Affiliation(s)
- Xinyuan Chen
- Department of Mathematics and Statistics, Mississippi State University, Mississippi State, MS, USA
| | - Michael O Harhay
- Clinical Trials Methods and Outcomes Lab, PAIR (Palliative and Advanced Illness Research) Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.,Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA.,Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, USA
| |
Collapse
|
9
|
Wang X, Turner EL, Preisser JS, Li F. Power considerations for generalized estimating equations analyses of four-level cluster randomized trials. Biom J 2022; 64:663-680. [PMID: 34897793 PMCID: PMC9574475 DOI: 10.1002/bimj.202100081] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 09/01/2021] [Accepted: 09/06/2021] [Indexed: 01/10/2023]
Abstract
In this article, we develop methods for sample size and power calculations in four-level intervention studies when intervention assignment is carried out at any level, with a particular focus on cluster randomized trials (CRTs). CRTs involving four levels are becoming popular in healthcare research, where the effects are measured, for example, from evaluations (level 1) within participants (level 2) in divisions (level 3) that are nested in clusters (level 4). In such multilevel CRTs, we consider three types of intraclass correlations between different evaluations to account for such clustering: that of the same participant, that of different participants from the same division, and that of different participants from different divisions in the same cluster. Assuming arbitrary link and variance functions, with the proposed correlation structure as the true correlation structure, closed-form sample size formulas for randomization carried out at any level (including individually randomized trials within a four-level clustered structure) are derived based on the generalized estimating equations approach using the model-based variance and using the sandwich variance with an independence working correlation matrix. We demonstrate that empirical power corresponds well with that predicted by the proposed method for as few as eight clusters, when data are analyzed using the matrix-adjusted estimating equations for the correlation parameters with a bias-corrected sandwich variance estimator, under both balanced and unbalanced designs.
Collapse
Affiliation(s)
- Xueqi Wang
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, 27707, USA
- Duke Global Health Institute, Durham, NC, 27707, USA
| | - Elizabeth L. Turner
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, 27707, USA
- Duke Global Health Institute, Durham, NC, 27707, USA
| | - John S. Preisser
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, 06511, USA
- Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, 06511, USA
| |
Collapse
|
10
|
Chen X, Li F. Finite-sample adjustments in variance estimators for clustered competing risks regression. Stat Med 2022; 41:2645-2664. [PMID: 35288959 DOI: 10.1002/sim.9375] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 01/23/2022] [Accepted: 02/23/2022] [Indexed: 12/19/2022]
Abstract
The marginal Fine-Gray proportional subdistribution hazards model is a popular approach to directly study the association between covariates and the cumulative incidence function with clustered competing risks data, which often arise in multicenter randomized trials or multilevel observational studies. To account for the within-cluster correlations between failure times, the uncertainty of the regression parameters estimators is quantified by the robust sandwich variance estimator, which may have unsatisfactory performance with a limited number of clusters. To overcome this limitation, we propose four bias-corrected variance estimators to reduce the negative bias of the usual sandwich variance estimator, extending the bias-correction techniques from generalized estimating equations with noncensored exponential family outcomes to clustered competing risks outcomes. We further compare their finite-sample operating characteristics through simulations and two real data examples. In particular, we found the Mancl and DeRouen (MD) type sandwich variance estimator generally has the smallest bias. Furthermore, with a small number of clusters, the Wald t -confidence interval with the MD sandwich variance estimator carries close to nominal coverage for the cluster-level effect parameter. The t -confidence intervals based on the sandwich variance estimator with any one of the three types of multiplicative bias correction or the z -confidence interval with the Morel, Bokossa and Neerchal (MBN) type sandwich variance estimator have close to nominal coverage for the individual-level effect parameter. Finally, we develop a user-friendly R package crrcbcv implementing the proposed sandwich variance estimators to assist practical applications.
Collapse
Affiliation(s)
- Xinyuan Chen
- Department of Mathematics and Statistics, Mississippi State University, Starkville, Mississippi, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA.,Center for Methods in Implementation and Prevention Science, Yale University School of Public Health, New Haven, Connecticut, USA.,Yale Center for Analytical Sciences, Yale University School of Public Health, New Haven, Connecticut, USA
| |
Collapse
|
11
|
Tong G, Esserman D, Li F. Accounting for unequal cluster sizes in designing cluster randomized trials to detect treatment effect heterogeneity. Stat Med 2021; 41:1376-1396. [PMID: 34923655 DOI: 10.1002/sim.9283] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 11/14/2021] [Accepted: 11/24/2021] [Indexed: 12/26/2022]
Abstract
Unequal cluster sizes are common in cluster randomized trials (CRTs). While there are a number of previous investigations studying the impact of unequal cluster sizes on the power for testing the average treatment effect in CRTs, little is known about the impact of unequal cluster sizes on the power for testing the heterogeneous treatment effect (HTE) in CRTs. In this work, we expand the sample size procedures for studying HTE in CRTs to accommodate cluster size variation under the linear mixed model framework. Through analytical derivation and graphical exploration, we show that the sample size for the HTE with an individual-level effect modifier is less affected by unequal cluster sizes than with a cluster-level effect modifier. The impact of cluster size variability jointly depends on the mean and coefficient of variation of cluster sizes, covariate intraclass correlation coefficient (ICC) and the conditional outcome ICC. In addition, we demonstrate that the HTE-motivated analysis of covariance framework can be used for analyzing the average treatment effect, and offer a more efficient sample size procedure for studying the average treatment effect adjusting for the effect modifier. We use simulations to confirm the accuracy of the proposed sample size procedures for both the average treatment effect and HTE in CRTs. Extensions to multivariate effect modifiers are provided and our procedure is illustrated in the context of the Strategies to Reduce Injuries and Develop Confidence in Elders trial.
Collapse
Affiliation(s)
- Guangyu Tong
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA.,Yale Center for Analytical Sciences, Yale School of Public Health, New Haven, Connecticut, USA
| | - Denise Esserman
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA.,Yale Center for Analytical Sciences, Yale School of Public Health, New Haven, Connecticut, USA
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA.,Yale Center for Analytical Sciences, Yale School of Public Health, New Haven, Connecticut, USA.,Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, Connecticut, USA
| |
Collapse
|
12
|
Tian Z, Preisser JS, Esserman D, Turner EL, Rathouz PJ, Li F. Impact of unequal cluster sizes for GEE analyses of stepped wedge cluster randomized trials with binary outcomes. Biom J 2021; 64:419-439. [PMID: 34596912 PMCID: PMC9292617 DOI: 10.1002/bimj.202100112] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 07/15/2021] [Accepted: 08/07/2021] [Indexed: 12/31/2022]
Abstract
The stepped wedge (SW) design is a type of unidirectional crossover design where cluster units switch from control to intervention condition at different prespecified time points. While a convention in study planning is to assume the cluster‐period sizes are identical, SW cluster randomized trials (SW‐CRTs) involving repeated cross‐sectional designs frequently have unequal cluster‐period sizes, which can impact the efficiency of the treatment effect estimator. In this paper, we provide a comprehensive investigation of the efficiency impact of unequal cluster sizes for generalized estimating equation analyses of SW‐CRTs, with a focus on binary outcomes as in the Washington State Expedited Partner Therapy trial. Several major distinctions between our work and existing work include the following: (i) we consider multilevel correlation structures in marginal models with binary outcomes; (ii) we study the implications of both the between‐cluster and within‐cluster imbalances in sizes; and (iii) we provide a comparison between the independence working correlation versus the true working correlation and detail the consequences of ignoring correlation estimation in SW‐CRTs with unequal cluster sizes. We conclude that the working independence assumption can lead to substantial efficiency loss and a large sample size regardless of cluster‐period size variability in SW‐CRTs, and recommend accounting for correlations in the analysis. To improve study planning, we additionally provide a computationally efficient search algorithm to estimate the sample size in SW‐CRTs accounting for unequal cluster‐period sizes, and conclude by illustrating the proposed approach in the context of the Washington State study.
Collapse
Affiliation(s)
- Zibo Tian
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA
| | - John S Preisser
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Denise Esserman
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA.,Yale Center for Analytical Sciences, New Haven, CT, USA
| | - Elizabeth L Turner
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA.,Duke Global Health Institute, Durham, NC, USA
| | - Paul J Rathouz
- Department of Population Health, The University of Texas at Austin, Austin, TX, USA
| | - Fan Li
- Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA.,Yale Center for Analytical Sciences, New Haven, CT, USA.,Center for Methods in Implementation and Prevention Science, Yale University, New Haven, CT, USA
| |
Collapse
|