1
|
Westgate PM, Nigam SR, Shoben AB. Reconsidering stepped wedge cluster randomized trial designs with implementation periods: Fewer sequences or the parallel-group design with baseline and implementation periods are potentially more efficient. Clin Trials 2024; 21:710-722. [PMID: 38650332 PMCID: PMC11493850 DOI: 10.1177/17407745241244790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
BACKGROUND/AIMS When designing a cluster randomized trial, advantages and disadvantages of tentative designs must be weighed. The stepped wedge design is popular for multiple reasons, including its potential to increase power via improved efficiency relative to a parallel-group design. In many realistic settings, it will take time for clusters to fully implement the intervention. When designing the HEALing (Helping to End Addiction Long-termSM) Communities Study, implementation time was a major consideration, and we examined the efficiency and practicality of three designs. Specifically, a three-sequence stepped wedge design with implementation periods, a corresponding two-sequence modified design that is created by removing the middle sequence, and a parallel-group design with baseline and implementation periods. In this article, we study the relative efficiencies of these specific designs. More generally, we study the relative efficiencies of modified designs when the stepped wedge design with implementation periods has three or more sequences. We also consider different correlation structures. METHODS We compare efficiencies of stepped wedge designs with implementation periods consisting of three to nine sequences with a variety of corresponding designs. The three-sequence design is compared to the two-sequence modified design and to the parallel-group design with baseline and implementation periods analysed via analysis of covariance. Stepped wedge designs with implementation periods consisting of four or more sequences are compared to modified designs that remove all or a subset of 'middle' sequences. Efficiencies are based on the use of linear mixed effects models. RESULTS In the studied settings, the modified design is more efficient than the three-sequence stepped wedge design with implementation periods. The parallel-group design with baseline and implementation periods with analysis of covariance-based analysis is often more efficient than the three-sequence design. With respect to stepped wedge designs with implementation periods that are comprised of more sequences, there are often corresponding modified designs that improve efficiency. However, use of only the first and last sequences has the potential to be either relatively efficient or inefficient. Relative efficiency is impacted by the strength of the statistical correlation among outcomes from the same cluster; for example, the relative efficiencies of modified designs tend to be greater for smaller cluster auto-correlation values. CONCLUSION If a three-sequence stepped wedge design with implementation periods is being considered for a future cluster randomized trial, then a corresponding modified design using only the first and last sequences should be considered if sole focus is on efficiency. However, a parallel-group design with baseline and implementation periods and analysis of covariance-based analysis can be a practical, efficient alternative. For stepped wedge designs with implementation periods and a larger number of sequences, modified versions that remove 'middle' sequences should be considered. Due to the potential sensitivity of design efficiencies, statistical correlation should be carefully considered.
Collapse
Affiliation(s)
- Philip M Westgate
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Shawn R Nigam
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Abigail B Shoben
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
2
|
Westgate PM, Cheng DM, Feaster DJ, Fernández S, Shoben AB, Vandergrift N. Marginal modeling in community randomized trials with rare events: Utilization of the negative binomial regression model. Clin Trials 2022; 19:162-171. [PMID: 34991359 PMCID: PMC9038610 DOI: 10.1177/17407745211063479] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
BACKGROUND/AIMS This work is motivated by the HEALing Communities Study, which is a post-test only cluster randomized trial in which communities are randomized to two different trial arms. The primary interest is in reducing opioid overdose fatalities, which will be collected as a count outcome at the community level. Communities range in size from thousands to over one million residents, and fatalities are expected to be rare. Traditional marginal modeling approaches in the cluster randomized trial literature include the use of generalized estimating equations with an exchangeable correlation structure when utilizing subject-level data, or analogously quasi-likelihood based on an over-dispersed binomial variance when utilizing community-level data. These approaches account for and estimate the intra-cluster correlation coefficient, which should be provided in the results from a cluster randomized trial. Alternatively, the coefficient of variation or R coefficient could be reported. In this article, we show that negative binomial regression can also be utilized when communities are large and events are rare. The objectives of this article are (1) to show that the negative binomial regression approach targets the same marginal regression parameter(s) as an over-dispersed binomial model and to explain why the estimates may differ; (2) to derive formulas relating the negative binomial overdispersion parameter k with the intra-cluster correlation coefficient, coefficient of variation, and R coefficient; and (3) analyze pre-intervention data from the HEALing Communities Study to demonstrate and contrast models and to show how to report the intra-cluster correlation coefficient, coefficient of variation, and R coefficient when utilizing negative binomial regression. METHODS Negative binomial and over-dispersed binomial regression modeling are contrasted in terms of model setup, regression parameter estimation, and formulation of the overdispersion parameter. Three specific models are used to illustrate concepts and address the third objective. RESULTS The negative binomial regression approach targets the same marginal regression parameter(s) as an over-dispersed binomial model, although estimates may differ. Practical differences arise in regard to how overdispersion, and hence the intra-cluster correlation coefficient is modeled. The negative binomial overdispersion parameter is approximately equal to the ratio of the intra-cluster correlation coefficient and marginal probability, the square of the coefficient of variation, and the R coefficient minus 1. As a result, estimates corresponding to all four of these different types of overdispersion parameterizations can be reported when utilizing negative binomial regression. CONCLUSION Negative binomial regression provides a valid, practical, alternative approach to the analysis of count data, and corresponding reporting of overdispersion parameters, from community randomized trials in which communities are large and events are rare.
Collapse
Affiliation(s)
- Philip M Westgate
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Debbie M Cheng
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
| | - Daniel J Feaster
- Department of Public Health Sciences, Miller School of Medicine, University of Miami, Coral Gables, FL, USA
| | - Soledad Fernández
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Abigail B Shoben
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA
| | | |
Collapse
|
3
|
Babić Ž, Kovačić J, Franić Z, Šakić F, Prester L, Varnai VM, Cvijetić Avdagić S, Bjelajac A, Macan J, Turk R. Prevention of poisonings by educational intervention aimed at parents of preschool children. Int J Inj Contr Saf Promot 2021; 28:486-493. [PMID: 34551681 DOI: 10.1080/17457300.2021.1955936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
The aim of the study was to assess the effectiveness of the specific design of a poisoning prevention intervention. This controlled before-after study followed Solomon design for educational interventions using two groups (the educational intervention group and the control group). Participants comprised parents of children attending kindergartens under the jurisdiction of the City of Zagreb and in the vicinity of Zagreb. The intervention group (N = 336) underwent an educational intervention during parents' meetings comprising oral presentation by the Croatian Poison Control Centre (CPCC) and distribution of gift packages containing child-proof locks, flyers, and stickers with the CPCC contact number. After the intervention they more frequently started keeping the CPCC's number by their telephone or in the list of important numbers than parents in the control group, and this association remained significant when tested by generalized estimating equations for binary outcomes, after the adjustment for parents' characteristics (age, gender and educational level), and clustered by kindergartens (p < 0.001). This means parents acknowledged the CPCC as an adequate and accessible way for initial management of poisoning incidents.
Collapse
Affiliation(s)
- Željka Babić
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia.,Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Jelena Kovačić
- Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Zrinka Franić
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia.,Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Franka Šakić
- Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Ljerka Prester
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia.,Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Veda Marija Varnai
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia.,Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Selma Cvijetić Avdagić
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia.,Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Adrijana Bjelajac
- Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Jelena Macan
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia.,Occupational Health and Environmental Medicine Unit, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| | - Rajka Turk
- Poison Control Centre, Institute for Medical Research and Occupational Health, Zagreb, Croatia
| |
Collapse
|
4
|
Chakraborty H, Solomon N, Anstrom KJ. A method to estimate intra-cluster correlation for clustered categorical data. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1914660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Hrishikesh Chakraborty
- Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, USA
- Duke Clinical Research Institute, Duke University, Durham, NC, USA
| | - Nicole Solomon
- Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, USA
- Duke Clinical Research Institute, Duke University, Durham, NC, USA
| | - Kevin J Anstrom
- Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, USA
- Duke Clinical Research Institute, Duke University, Durham, NC, USA
| |
Collapse
|
5
|
Westgate PM. A readily available improvement over method of moments for intra-cluster correlation estimation in the context of cluster randomized trials and fitting a GEE-type marginal model for binary outcomes. Clin Trials 2018; 16:41-51. [PMID: 30295512 DOI: 10.1177/1740774518803635] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
BACKGROUND/AIMS Cluster randomized trials are popular in health-related research due to the need or desire to randomize clusters of subjects to different trial arms as opposed to randomizing each subject individually. As outcomes from subjects within the same cluster tend to be more alike than outcomes from subjects within other clusters, an exchangeable correlation arises that is measured via the intra-cluster correlation coefficient. Intra-cluster correlation coefficient estimation is especially important due to the increasing awareness of the need to publish such values from studies in order to help guide the design of future cluster randomized trials. Therefore, numerous methods have been proposed to accurately estimate the intra-cluster correlation coefficient, with much attention given to binary outcomes. As marginal models are often of interest, we focus on intra-cluster correlation coefficient estimation in the context of fitting such a model with binary outcomes using generalized estimating equations. Traditionally, intra-cluster correlation coefficient estimation with generalized estimating equations has been based on the method of moments, although such estimators can be negatively biased. Furthermore, alternative estimators that work well, such as the analysis of variance estimator, are not as readily applicable in the context of practical data analyses with generalized estimating equations. Therefore, in this article we assess, in terms of bias, the readily available residual pseudo-likelihood approach to intra-cluster correlation coefficient estimation with the GLIMMIX procedure of SAS (SAS Institute, Cary, NC). Furthermore, we study a possible corresponding approach to confidence interval construction for the intra-cluster correlation coefficient. METHODS We utilize a simulation study and application example to assess bias in intra-cluster correlation coefficient estimates obtained from GLIMMIX using residual pseudo-likelihood. This estimator is contrasted with method of moments and analysis of variance estimators which are standards of comparison. The approach to confidence interval construction is assessed by examining coverage probabilities. RESULTS Overall, the residual pseudo-likelihood estimator performs very well. It has considerably less bias than moment estimators, which are its competitor for general generalized estimating equation-based analyses, and therefore, it is a major improvement in practice. Furthermore, it works almost as well as analysis of variance estimators when they are applicable. Confidence intervals have near-nominal coverage when the intra-cluster correlation coefficient estimate has negligible bias. CONCLUSION Our results show that the residual pseudo-likelihood estimator is a good option for intra-cluster correlation coefficient estimation when conducting a generalized estimating equation-based analysis of binary outcome data arising from cluster randomized trials. The estimator is practical in that it is simply a result from fitting a marginal model with GLIMMIX, and a confidence interval can be easily obtained. An additional advantage is that, unlike most other options for performing generalized estimating equation-based analyses, GLIMMIX provides analysts the option to utilize small-sample adjustments that ensure valid inference.
Collapse
Affiliation(s)
- Philip M Westgate
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA
| |
Collapse
|
6
|
Chakraborty H, Hossain A. R package to estimate intracluster correlation coefficient with confidence interval for binary data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 155:85-92. [PMID: 29512507 DOI: 10.1016/j.cmpb.2017.10.023] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Revised: 09/21/2017] [Accepted: 10/28/2017] [Indexed: 05/26/2023]
Abstract
BACKGROUND AND OBJECTIVE The Intracluster Correlation Coefficient (ICC) is a major parameter of interest in cluster randomized trials that measures the degree to which responses within the same cluster are correlated. There are several types of ICC estimators and its confidence intervals (CI) suggested in the literature for binary data. Studies have compared relative weaknesses and advantages of ICC estimators as well as its CI for binary data and suggested situations where one is advantageous in practical research. The commonly used statistical computing systems currently facilitate estimation of only a very few variants of ICC and its CI. To address the limitations of current statistical packages, we developed an R package, ICCbin, to facilitate estimating ICC and its CI for binary responses using different methods. METHODS The ICCbin package is designed to provide estimates of ICC in 16 different ways including analysis of variance methods, moments based estimation, direct probabilistic methods, correlation based estimation, and resampling method. CI of ICC is estimated using 5 different methods. It also generates cluster binary data using exchangeable correlation structure. RESULTS ICCbin package provides two functions for users. The function rcbin() generates cluster binary data and the function iccbin() estimates ICC and it's CI. The users can choose appropriate ICC and its CI estimate from the wide selection of estimates from the outputs. CONCLUSIONS The R package ICCbin presents very flexible and easy to use ways to generate cluster binary data and to estimate ICC and it's CI for binary response using different methods. The package ICCbin is freely available for use with R from the CRAN repository (https://cran.r-project.org/package=ICCbin). We believe that this package can be a very useful tool for researchers to design cluster randomized trials with binary outcome.
Collapse
Affiliation(s)
| | - Akhtar Hossain
- Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina, Columbia, SC, USA.
| |
Collapse
|