1
|
Jaspers S, Verbeeck J, Thas O. Covariate-adjusted generalized pairwise comparisons in small samples. Stat Med 2024; 43:4027-4042. [PMID: 38963080 DOI: 10.1002/sim.10140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 04/15/2024] [Accepted: 05/31/2024] [Indexed: 07/05/2024]
Abstract
Semiparametric probabilistic index models allow for the comparison of two groups of observations, whilst adjusting for covariates, thereby fitting nicely within the framework of generalized pairwise comparisons (GPC). As with most regression approaches in this setting, the limited amount of data results in invalid inference as the asymptotic normality assumption is not met. In addition, separation issues might arise when considering small samples. In this article, we show that the parameters of the probabilistic index model can be estimated using generalized estimating equations, for which adjustments exist that lead to estimators of the sandwich variance-covariance matrix with improved finite sample properties and that can deal with bias due to separation. In this way, appropriate inference can be performed as is shown through extensive simulation studies. The known relationships between the probabilistic index and other GPC statistics allow to also provide valid inference for example, the net treatment benefit or the success odds.
Collapse
Affiliation(s)
- Stijn Jaspers
- Data Science Institute and I-BioStat, Hasselt University, Diepenbeek, Belgium
| | - Johan Verbeeck
- Data Science Institute and I-BioStat, Hasselt University, Diepenbeek, Belgium
| | - Olivier Thas
- Data Science Institute and I-BioStat, Hasselt University, Diepenbeek, Belgium
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- National Institute of Applied Statistics Research Australia (NIASRA), University of Wollongong, Wollongong, New South Wales, Australia
| |
Collapse
|
2
|
Hagiwara Y, Matsuyama Y. Goodness-of-fit tests for modified Poisson regression possibly producing fitted values exceeding one in binary outcome analysis. Stat Methods Med Res 2024; 33:1185-1196. [PMID: 38780488 DOI: 10.1177/09622802241254220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
Modified Poisson regression, which estimates the regression parameters in the log-binomial regression model using the Poisson quasi-likelihood estimating equation and robust variance, is a useful tool for estimating the adjusted risk and prevalence ratio in binary outcome analysis. Although several goodness-of-fit tests have been developed for other binary regressions, few goodness-of-fit tests are available for modified Poisson regression. In this study, we proposed several goodness-of-fit tests for modified Poisson regression, including the modified Hosmer-Lemeshow test with empirical variance, Tsiatis test, normalized Pearson chi-square tests with binomial variance and Poisson variance, and normalized residual sum of squares test. The original Hosmer-Lemeshow test and normalized Pearson chi-square test with binomial variance are inappropriate for the modified Poisson regression, which can produce a fitted value exceeding 1 owing to the unconstrained parameter space. A simulation study revealed that the normalized residual sum of squares test performed well regarding the type I error probability and the power for a wrong link function. We applied the proposed goodness-of-fit tests to the analysis of cross-sectional data of patients with cancer. We recommend the normalized residual sum of squares test as a goodness-of-fit test in the modified Poisson regression.
Collapse
Affiliation(s)
- Yasuhiro Hagiwara
- Department of Biostatistics, School of Public Health, The University of Tokyo, Japan
| | - Yutaka Matsuyama
- Department of Biostatistics, School of Public Health, The University of Tokyo, Japan
| |
Collapse
|
3
|
Gosho M, Ishii R, Noma H, Maruo K. A comparison of bias-adjusted generalized estimating equations for sparse binary data in small-sample longitudinal studies. Stat Med 2023. [PMID: 37062288 DOI: 10.1002/sim.9744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Revised: 12/28/2022] [Accepted: 04/02/2023] [Indexed: 04/18/2023]
Abstract
Using a generalized estimating equation (GEE) can lead to a bias in regression coefficients for a small sample or sparse data. The bias-corrected GEE (BCGEE) and penalized GEE (PGEE) were proposed to resolve the small-sample bias. Moreover, the standard sandwich covariance estimator leads to a bias of standard error for small samples; several modified covariance estimators have been proposed to address this issue. We review the modified GEEs and modified covariance estimators, and evaluate their performance in sparse binary data from small-sample longitudinal studies. The simulation results showed that GEE and BCGEE often failed to achieve convergence, whereas the convergence proportion for PGEE was quite high. The bias for the regression coefficients was generally in the ascending order of PGEE < $$ < $$ BCGEE < $$ < $$ GEE. However, PGEE and BCGEE did not sufficiently remove the bias involving 20-30 subjects with unequal exposure levels with a 5% response rate. The coverage probability (CP) of the confidence interval for BCGEE was relatively poor compared with GEE and PGEE. The CP with the sandwich covariance estimator deteriorated regardless of the GEE methods under the small sample size and low response rate, whereas the CP with the modified covariance estimators-such as Morel's method-was relatively acceptable. PGEE will be the reasonable way for analyzing sparse binary data in small-sample studies. Instead of using the standard sandwich covariance estimator, one should always apply the modified covariance estimators for analyzing these data.
Collapse
Affiliation(s)
- Masahiko Gosho
- Department of Biostatistics, Institute of Medicine, University of Tsukuba, Tennodai, Tsukuba, Japan
| | - Ryota Ishii
- Department of Biostatistics, Institute of Medicine, University of Tsukuba, Tennodai, Tsukuba, Japan
| | - Hisashi Noma
- Department of Data Science, The Institute of Statistical Mathematics, Tachikawa, Japan
| | - Kazushi Maruo
- Department of Biostatistics, Institute of Medicine, University of Tsukuba, Tennodai, Tsukuba, Japan
| |
Collapse
|
4
|
Cruz Gutierrez NA, Melo OO, Martinez CA. Semiparametric generalized estimating equations for repeated measurements in cross-over designs. Stat Methods Med Res 2023:9622802231158736. [PMID: 36919447 DOI: 10.1177/09622802231158736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
Abstract
A model for cross-over designs with repeated measures within each period was developed. It was obtained using an extension of generalized estimating equations that includes a parametric component to model treatment effects and a non-parametric component to model time and carry-over effects; the estimation approach for the non-parametric component is based on splines. A simulation study was carried out to explore the model properties. Thus, when there is a carry-over effect or a functional temporal effect, the proposed model presents better results than the standard models. Among the theoretical properties, the solution is found to be analogous to weighted least squares. Therefore, model diagnostics can be made by adapting the results from a multiple regression. The proposed methodology was implemented in the data sets of the cross-over experiments that motivated the approach of this work: systolic blood pressure and insulin in rabbits.
Collapse
Affiliation(s)
| | - Oscar Orlando Melo
- Departamento de Estadística, Facultad de Ciencias, 28021Universidad Nacional de Colombia, Mosquera, Colombia
| | - Carlos Alberto Martinez
- Corporaciòn Colombiana de Investigaciòn Agropecuaria - AGROSAVIA, Sede Central, Mosquera,Colombia
| |
Collapse
|
5
|
Martín-María N, Lara E, Cabello M, Olaya B, Haro JM, Miret M, Ayuso-Mateos JL. To be happy and behave in a healthier way. A longitudinal study about gender differences in the older population. Psychol Health 2023; 38:307-323. [PMID: 34353185 DOI: 10.1080/08870446.2021.1960988] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
INTRODUCTION Subjective well-being plays a key role in health. The objectives of this study are to analyse the longitudinal associations between subjective well-being dimensions and healthy behaviours, and to examine gender differences. METHOD A representative sample of 1,190 Spanish non-institutionalised adults aged 50+ were interviewed over a 6-year follow-up period. The Cantril scale was used to measure evaluative well-being. The Day Reconstruction Method measured experienced well-being. The Global Physical Activity Questionnaire was used, whereas fruit and vegetables, tobacco and alcohol consumption, and sleep quality were self-reported. The Generalised Estimating Equation was calculated. RESULTS Women show significantly worse subjective well-being than men longitudinally. Higher scores in life satisfaction and positive affect were significantly related to a higher level of physical activity and better-quality sleep for both women and men. Associations between a higher life satisfaction and an adequate intake of fruits and vegetables and being a non-smoker was only found in women (OR = 1.05; 95% IC = 1.00, 1.10 and OR = 1.16; 95% IC = 1.09, 1.23, respectively). CONCLUSION Subjective well-being levels and frequencies in healthy behaviours are different in women and men. Subjective well-being interventions should take into account these differences in the frequency of healthy-unhealthy behaviours.
Collapse
Affiliation(s)
- Natalia Martín-María
- Department of Psychiatry, Universidad Autónoma de Madrid, Spain.,Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - Elvira Lara
- Department of Psychiatry, Universidad Autónoma de Madrid, Spain.,Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - María Cabello
- Department of Psychiatry, Universidad Autónoma de Madrid, Spain.,Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - Beatriz Olaya
- Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Research, Innovation and Teaching Unit, Parc Sanitari Sant Joan de Déu, Sant Boi de Llobregat, Barcelona, Spain
| | - Josep Maria Haro
- Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Research, Innovation and Teaching Unit, Parc Sanitari Sant Joan de Déu, Sant Boi de Llobregat, Barcelona, Spain.,Department of Psychiatry and Clinical Psychobiology, Universitat de Barcelona, Barcelona, Spain
| | - Marta Miret
- Department of Psychiatry, Universidad Autónoma de Madrid, Spain.,Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - José Luis Ayuso-Mateos
- Department of Psychiatry, Universidad Autónoma de Madrid, Spain.,Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Instituto de Salud Carlos III, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| |
Collapse
|
6
|
Wu W, He K, Shi X, Schaubel DE, Kalbfleisch JD. Analysis of hospital readmissions with competing risks. Stat Methods Med Res 2022; 31:2189-2200. [PMID: 35899312 PMCID: PMC9931495 DOI: 10.1177/09622802221115879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The 30-day hospital readmission rate has been used in provider profiling for evaluating inter-provider care coordination, medical cost effectiveness, and patient quality of life. Current profiling analyzes use logistic regression to model 30-day readmission as a binary outcome, but one disadvantage of this approach is that this outcome is strongly affected by competing risks (e.g., death). Thus, one, perhaps unintended, consequence is that if two facilities have the same rates of readmission, the one with the higher rate of competing risks will have the lower 30-day readmission rate. We propose a discrete time competing risk model wherein the cause-specific readmission hazard is used to assess provider-level effects. This approach takes account of the timing of events and focuses on the readmission rates which are of primary interest. The quality measure, then is a standardized readmission ratio, akin to a standardized mortality ratio. This measure is not systematically affected by the rate of competing risks. To facilitate the estimation and inference of a large number of provider effects, we develop an efficient Blockwise Inversion Newton algorithm, and a stabilized robust score test that overcomes the conservative nature of the classical robust score test. An application to dialysis patients demonstrates improved profiling, model fitting, and outlier detection over existing methods.
Collapse
Affiliation(s)
- Wenbo Wu
- Department of Biostatistics and Kidney Epidemiology and Cost Center, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Kevin He
- Department of Biostatistics and Kidney Epidemiology and Cost Center, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Xu Shi
- Department of Biostatistics and Kidney Epidemiology and Cost Center, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Douglas E Schaubel
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - John D Kalbfleisch
- Department of Biostatistics and Kidney Epidemiology and Cost Center, University of Michigan School of Public Health, Ann Arbor, MI, USA
| |
Collapse
|
7
|
Ge X, Peng Y, Tu D. A generalized single‐index linear threshold model for identifying treatment‐sensitive subsets based on multiple covariates and longitudinal measurements. CAN J STAT 2022. [DOI: 10.1002/cjs.11737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Xinyi Ge
- Department of Mathematics and Statistics Queen's University Kingston Ontario Canada
| | - Yingwei Peng
- Departments of Mathematics and Statistics & Public Health Sciences Queen's University Kingston Ontario Canada
| | - Dongsheng Tu
- Departments of Mathematics and Statistics & Public Health Sciences and Canadian Cancer Trials Group Queen's University Kingston Ontario Canada
| |
Collapse
|
8
|
Monchatre-Leroy E, Sauvage F, Boué F, Augot D, Marianneau P, Hénaux V, Crespin L. Prevalence and Incidence of Puumala Orthohantavirus in its Bank Vole (Myodes glareolus) Host Population in Northeastern France: Between-site and Seasonal Variability. Epidemics 2022; 40:100600. [DOI: 10.1016/j.epidem.2022.100600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 05/02/2022] [Accepted: 06/14/2022] [Indexed: 11/03/2022] Open
|
9
|
Hui FK. GEE-assisted Forward Regression for Spatial Latent Variable Models. J Comput Graph Stat 2022. [DOI: 10.1080/10618600.2022.2058002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Francis K.C. Hui
- Research School of Finance, Actuarial Studies and Statistics, The Australian National University, Canberra, Australia
| |
Collapse
|
10
|
Hu S, Wang YG, Fu L. Performance of variance estimators in the analysis of longitudinal data with a large cluster size. J STAT COMPUT SIM 2022. [DOI: 10.1080/00949655.2021.1929983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Shuwen Hu
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
| | - You-Gan Wang
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
| | - Liya Fu
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, People's Republic of China
| |
Collapse
|
11
|
Hui FKC, Müller S, Welsh AH. GEE-Assisted Variable Selection for Latent Variable Models with Multivariate Binary Data. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1987251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Francis K. C. Hui
- Research School of Finance, Actuarial Studies & Statistics, Australian National University, Canberra, Australia
| | - Samuel Müller
- School of Mathematics and Statistics, University of Sydney, Sydney, Australia
| | - A. H. Welsh
- Research School of Finance, Actuarial Studies & Statistics, Australian National University, Canberra, Australia
| |
Collapse
|
12
|
Im S. Performance of the Beta-Binomial Model for Clustered Binary Responses: Comparison with Generalized Estimating Equations. JOURNAL OF MODERN APPLIED STATISTICAL METHODS 2021. [DOI: 10.22237/jmasm/1619482380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
This study examined performance of the beta-binomial model in comparison with GEE using clustered binary responses resulting in non-normal outcomes. Monte Carlo simulations were performed under varying intracluster correlations and sample sizes. The results showed that the beta-binomial model performed better for small sample, while GEE performed well under large sample.
Collapse
|
13
|
Assessment of a Modified Sandwich Estimator for Generalized Estimating Equations with Application to Opioid Poisoning in MIMIC-IV ICU Patients. STATS 2021. [DOI: 10.3390/stats4030039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Longitudinal data is encountered frequently in many healthcare research areas to include the critical care environment. Repeated measures from the same subject are expected to correlate with each other. Models with binary outcomes are commonly used in this setting. Regression models for correlated binary outcomes are frequently fit using generalized estimating equations (GEE). The Liang and Zeger sandwich estimator is often used in GEE to produce unbiased standard error estimation for regression coefficients in large sample settings, even when the covariance structure is misspecified. The sandwich estimator performs optimally in balanced designs when the number of participants is large with few repeated measurements. The sandwich estimator’s asymptotic properties do not hold in small sample and rare-event settings. Under these conditions, the sandwich estimator underestimates the variances and is biased downwards. Here, the performance of a modified sandwich estimator is compared to the traditional Liang-Zeger estimator and alternative forms proposed by authors Morel, Pan, and Mancl-DeRouen. Each estimator’s performance was assessed with 95% coverage probabilities for the regression coefficients using simulated data under various combinations of sample sizes and outcome prevalence values with independence and autoregressive correlation structures. This research was motivated by investigations involving rare-event outcomes in intensive care unit settings.
Collapse
|
14
|
Luepsen H. ANOVA with binary variables: the F-test and some alternatives. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2020.1869983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Haiko Luepsen
- Regional Computing Centre, University of Cologne, Cologne, Germany
| |
Collapse
|
15
|
Li F, Tong G. Sample size estimation for modified Poisson analysis of cluster randomized trials with a binary outcome. Stat Methods Med Res 2021; 30:1288-1305. [PMID: 33826454 PMCID: PMC9132618 DOI: 10.1177/0962280221990415] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The modified Poisson regression coupled with a robust sandwich variance has become a viable alternative to log-binomial regression for estimating the marginal relative risk in cluster randomized trials. However, a corresponding sample size formula for relative risk regression via the modified Poisson model is currently not available for cluster randomized trials. Through analytical derivations, we show that there is no loss of asymptotic efficiency for estimating the marginal relative risk via the modified Poisson regression relative to the log-binomial regression. This finding holds both under the independence working correlation and under the exchangeable working correlation provided a simple modification is used to obtain the consistent intraclass correlation coefficient estimate. Therefore, the sample size formulas developed for log-binomial regression naturally apply to the modified Poisson regression in cluster randomized trials. We further extend the sample size formulas to accommodate variable cluster sizes. An extensive Monte Carlo simulation study is carried out to validate the proposed formulas. We find that the proposed formulas have satisfactory performance across a range of cluster size variability, as long as suitable finite-sample corrections are applied to the sandwich variance estimator and the number of clusters is at least 10. Our findings also suggest that the sample size estimate under the exchangeable working correlation is more robust to cluster size variability, and recommend the use of an exchangeable working correlation over an independence working correlation for both design and analysis. The proposed sample size formulas are illustrated using the Stop Colorectal Cancer (STOP CRC) trial.
Collapse
Affiliation(s)
- Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Center for Methods in Implementation and Preventive Science, Yale University, New Haven, CT, USA
- Yale Center for Analytical Sciences, Yale University, New Haven, CT, USA
| | - Guangyu Tong
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Yale Center for Analytical Sciences, Yale University, New Haven, CT, USA
| |
Collapse
|
16
|
Gosho M, Noma H, Maruo K. Practical Review and Comparison of Modified Covariance Estimators for Linear Mixed Models in Small‐sample Longitudinal Studies with Missing Data. Int Stat Rev 2021. [DOI: 10.1111/insr.12447] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Masahiko Gosho
- Department of Biostatistics, Faculty of Medicine University of Tsukuba 1‐1‐1 Tennodai Tsukuba Ibaraki 305‐8575 Japan
| | - Hisashi Noma
- Department of Data Science The Institute of Statistical Mathematics 10‐3 Midori‐cho Tachikawa Tokyo 190‐8562 Japan
| | - Kazushi Maruo
- Department of Biostatistics, Faculty of Medicine University of Tsukuba 1‐1‐1 Tennodai Tsukuba Ibaraki 305‐8575 Japan
| |
Collapse
|
17
|
Zhang Q, Yi GY. Marginal analysis of bivariate mixed responses with measurement error and misclassification. Stat Methods Med Res 2021; 30:1155-1186. [PMID: 33635738 DOI: 10.1177/0962280220983587] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Bivariate responses with mixed continuous and binary variables arise commonly in applications such as clinical trials and genetic studies. Statistical methods based on jointly modeling continuous and binary variables have been available. However, such methods ignore the effects of response mismeasurement, a ubiquitous feature in applications. It has been well studied that in many settings, ignorance of mismeasurement in variables usually results in biased estimation. In this paper, we consider the setting with a bivariate outcome vector which contains a continuous component and a binary component both subject to mismeasurement. We propose estimating equation approaches to handle measurement error in the continuous response and misclassification in the binary response simultaneously. The proposed estimators are consistent and robust to certain model misspecification, provided regularity conditions. Extensive simulation studies confirm that the proposed methods successfully correct the biases resulting from the error-in-variables under various settings. The proposed methods are applied to analyze the outbred Carworth Farms White mice data arising from a genome-wide association study.
Collapse
Affiliation(s)
- Qihuang Zhang
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada
| | - Grace Y Yi
- Department of Statistics and Actuarial Sciences and Department of Computer Science, University of Western Ontario, London, ON, Canada
| |
Collapse
|
18
|
Thompson JA, Hemming K, Forbes A, Fielding K, Hayes R. Comparison of small-sample standard-error corrections for generalised estimating equations in stepped wedge cluster randomised trials with a binary outcome: A simulation study. Stat Methods Med Res 2021; 30:425-439. [PMID: 32970526 PMCID: PMC8008420 DOI: 10.1177/0962280220958735] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Generalised estimating equations with the sandwich standard-error estimator provide a promising method of analysis for stepped wedge cluster randomised trials. However, they have inflated type-one error when used with a small number of clusters, which is common for stepped wedge cluster randomised trials. We present a large simulation study of binary outcomes comparing bias-corrected standard errors from Fay and Graubard; Mancl and DeRouen; Kauermann and Carroll; Morel, Bokossa, and Neerchal; and Mackinnon and White with an independent and exchangeable working correlation matrix. We constructed 95% confidence intervals using a t-distribution with degrees of freedom including clusters minus parameters (DFC-P), cluster periods minus parameters, and estimators from Fay and Graubard (DFFG), and Pan and Wall. Fay and Graubard and an approximation to Kauermann and Carroll (with simpler matrix inversion) were unbiased in a wide range of scenarios with an independent working correlation matrix and more than 12 clusters. They gave confidence intervals with close to 95% coverage with DFFG with 12 or more clusters, and DFC-P with 18 or more clusters. Both standard errors were conservative with fewer clusters. With an exchangeable working correlation matrix, approximated Kauermann and Carroll and Fay and Graubard had a small degree of under-coverage.
Collapse
Affiliation(s)
- JA Thompson
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - K Hemming
- Institute of Applied Health Research, University of Birmingham, Birmingham, UK
| | - A Forbes
- Biostatistics Unit, Monash University, Melbourne, Australia
| | - K Fielding
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - R Hayes
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| |
Collapse
|
19
|
Kruppa J, Hothorn L. A comparison study on modeling of clustered and overdispersed count data for multiple comparisons. J Appl Stat 2020; 48:3220-3232. [PMID: 35707260 PMCID: PMC9042126 DOI: 10.1080/02664763.2020.1788518] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Data collected in various scientific fields are count data. One way to analyze such data is to compare the individual levels of the factor treatment using multiple comparisons. However, the measured individuals are often clustered - e.g. according to litter or rearing. This must be considered when estimating the parameters by a repeated measurement model. In addition, ignoring the overdispersion to which count data is prone leads to an increase of the type one error rate. We carry out simulation studies using several different data settings and compare different multiple contrast tests with parameter estimates from generalized estimation equations and generalized linear mixed models in order to observe coverage and rejection probabilities. We generate overdispersed, clustered count data in small samples as can be observed in many biological settings. We have found that the generalized estimation equations outperform generalized linear mixed models if the variance-sandwich estimator is correctly specified. Furthermore, generalized linear mixed models show problems with the convergence rate under certain data settings, but there are model implementations with lower implications exists. Finally, we use an example of genetic data to demonstrate the application of the multiple contrast test and the problems of ignoring strong overdispersion.
Collapse
Affiliation(s)
- Jochen Kruppa
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Berlin, Germany.,Berlin Institute of Health (BIH), Berlin, Germany
| | - Ludwig Hothorn
- Institute of Biostatistics, Leibniz University Hannover, Germany, Hannover, Germany
| |
Collapse
|
20
|
Martín-María N, Caballero FF, Moreno-Agostino D, Olaya B, Haro JM, Ayuso-Mateos JL, Miret M. Relationship between subjective well-being and healthy lifestyle behaviours in older adults: a longitudinal study. Aging Ment Health 2020; 24:611-619. [PMID: 30590962 DOI: 10.1080/13607863.2018.1548567] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Objectives: People who report better subjective well-being tend to be healthier in their daily behaviours. The objective of this study is to assess whether different components of subjective well-being are prospectively associated with different healthy lifestyle behaviours and to assess whether these associations differ by age.Method: A total of 1,892 participants aged 50+ living in Spain were interviewed in 2011-12 and 2014-15. Life satisfaction was measured with the Cantril Self-Anchoring Striving Scale. Positive and negative affect were assessed using the Day Reconstruction Method. Physical activity was assessed with the Global Physical Activity Questionnaire version 2. The remaining healthy lifestyle behaviours were self-reported. Generalised Estimating Equations (GEE) models were run.Results: Not having a heavy episodic alcohol drinking was the healthy lifestyle behaviour most fulfilled (97.97%), whereas the intake of five or more fruits and vegetables was the least followed (33.12%). GEE models conducted over the 50-64 and the 65+ age groups showed that a higher life satisfaction was significantly related to a higher physical activity in both groups. Relationships between a higher negative affect and presenting a lower level of physical activity, and a higher positive affect and following the right consumption of fruits and vegetables and being a non-daily smoker, were only found in the older group.Conclusion: The relationship between subjective well-being and healthy lifestyle behaviours was found fundamentally in those aged 65+ years. Interventions focused on incrementing subjective well-being would have an impact on keeping a healthy lifestyle and, therefore, on reducing morbidity and mortality.
Collapse
Affiliation(s)
- Natalia Martín-María
- Department of Psychiatry, Universidad Autónoma de Madrid, Madrid, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Madrid, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - Francisco Félix Caballero
- Department of Preventive Medicine, Public Health and Microbiology, Universidad Autónoma de Madrid, Madrid, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Epidemiología y Salud Pública. CIBERESP, Madrid, Spain
| | - Darío Moreno-Agostino
- Department of Psychiatry, Universidad Autónoma de Madrid, Madrid, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Madrid, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - Beatriz Olaya
- Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Madrid, Spain.,Parc Sanitari Sant Joan de Déu, Universitat de Barcelona, Sant Boi de Llobregat, Barcelona, Spain
| | - Josep Maria Haro
- Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Madrid, Spain.,Parc Sanitari Sant Joan de Déu, Universitat de Barcelona, Sant Boi de Llobregat, Barcelona, Spain
| | - José Luis Ayuso-Mateos
- Department of Psychiatry, Universidad Autónoma de Madrid, Madrid, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Madrid, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| | - Marta Miret
- Department of Psychiatry, Universidad Autónoma de Madrid, Madrid, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red de Salud Mental. CIBERSAM, Madrid, Spain.,Department of Psychiatry, Hospital Universitario de La Princesa, Instituto de Investigación Sanitaria Princesa (IIS-Princesa), Madrid, Spain
| |
Collapse
|
21
|
Berk M, Mohebbi M, Dean OM, Cotton SM, Chanen AM, Dodd S, Ratheesh A, Amminger GP, Phelan M, Weller A, Mackinnon A, Giorlando F, Baird S, Incerti L, Brodie RE, Ferguson NO, Rice S, Schäfer MR, Mullen E, Hetrick S, Kerr M, Harrigan SM, Quinn AL, Mazza C, McGorry P, Davey CG. Youth Depression Alleviation with Anti-inflammatory Agents (YoDA-A): a randomised clinical trial of rosuvastatin and aspirin. BMC Med 2020; 18:16. [PMID: 31948461 PMCID: PMC6966789 DOI: 10.1186/s12916-019-1475-6] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Accepted: 11/27/2019] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Inflammation contributes to the pathophysiology of major depressive disorder (MDD), and anti-inflammatory strategies might therefore have therapeutic potential. This trial aimed to determine whether adjunctive aspirin or rosuvastatin, compared with placebo, reduced depressive symptoms in young people (15-25 years). METHODS YoDA-A, Youth Depression Alleviation with Anti-inflammatory Agents, was a 12-week triple-blind, randomised, controlled trial. Participants were young people (aged 15-25 years) with moderate to severe MDD (MADRS mean at baseline 32.5 ± 6.0; N = 130; age 20.2 ± 2.6; 60% female), recruited between June 2013 and June 2017 across six sites in Victoria, Australia. In addition to treatment as usual, participants were randomised to receive aspirin (n = 40), rosuvastatin (n = 48), or placebo (n = 42), with assessments at baseline and weeks 4, 8, 12, and 26. The primary outcome was change in the Montgomery-Åsberg Depression Rating Scale (MADRS) from baseline to week 12. RESULTS At the a priori primary endpoint of MADRS differential change from baseline at week 12, there was no significant difference between aspirin and placebo (1.9, 95% CI (- 2.8, 6.6), p = 0.433), or rosuvastatin and placebo (- 4.2, 95% CI (- 9.1, 0.6), p = 0.089). For rosuvastatin, secondary outcomes on self-rated depression and global impression, quality of life, functioning, and mania were not significantly different from placebo. Aspirin was inferior to placebo on the Quality of Life Enjoyment and Satisfaction Questionnaire (Q-LES-Q-SF) at week 12. Statins were superior to aspirin on the MADRS, the Clinical Global Impressions Severity Scale (CGI-S), and the Negative Problem Orientation Questionnaire scale (NPOQ) at week 12. CONCLUSIONS The addition of either aspirin or rosuvastatin did not to confer any beneficial effect over and above routine treatment for depression in young people. Exploratory comparisons of secondary outcomes provide limited support for a potential therapeutic role for adjunctive rosuvastatin, but not for aspirin, in youth depression. TRIAL REGISTRATION Australian New Zealand Clinical Trials Registry, ACTRN12613000112763. Registered on 30/01/2013.
Collapse
Affiliation(s)
- Michael Berk
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia. .,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia. .,The Institute for Mental and Physical Health and Clinical Translation, Deakin University, Geelong, Australia. .,Florey Institute for Neuroscience and Mental Health, University of Melbourne, Parkville, Australia. .,Department of Psychiatry, University of Melbourne, Parkville, Australia. .,Barwon Health, PO Box 281, Geelong, Victoria, 3220, Australia.
| | - Mohammadreza Mohebbi
- The Institute for Mental and Physical Health and Clinical Translation, Deakin University, Geelong, Australia.,Biostatistics Unit, Faculty of Health, Deakin University, Geelong, Australia
| | - Olivia M Dean
- The Institute for Mental and Physical Health and Clinical Translation, Deakin University, Geelong, Australia.,Florey Institute for Neuroscience and Mental Health, University of Melbourne, Parkville, Australia.,Barwon Health, PO Box 281, Geelong, Victoria, 3220, Australia
| | - Sue M Cotton
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Andrew M Chanen
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| | - Seetal Dodd
- Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,The Institute for Mental and Physical Health and Clinical Translation, Deakin University, Geelong, Australia.,Department of Psychiatry, University of Melbourne, Parkville, Australia.,Barwon Health, PO Box 281, Geelong, Victoria, 3220, Australia
| | - Aswin Ratheesh
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| | - G Paul Amminger
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Mark Phelan
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| | - Amber Weller
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Andrew Mackinnon
- Black Dog Institute, University of New South Wales, Sydney, Australia.,Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Australia
| | - Francesco Giorlando
- Department of Psychiatry, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| | - Shelley Baird
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Lisa Incerti
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Rachel E Brodie
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Natalie O Ferguson
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Simon Rice
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| | - Miriam R Schäfer
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Edward Mullen
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| | - Sarah Hetrick
- Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Department of Psychological Medicine, University of Auckland, Auckland, New Zealand
| | - Melissa Kerr
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Susy M Harrigan
- Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Australia.,Department of Social Work, Monash University, Melbourne, Australia
| | - Amelia L Quinn
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Catherine Mazza
- The Institute for Mental and Physical Health and Clinical Translation, Deakin University, Geelong, Australia
| | - Patrick McGorry
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia
| | - Christopher G Davey
- Orygen, the National Centre of Excellence in Youth Mental Health, Melbourne, Australia.,Centre for Youth Mental Health, University of Melbourne, Parkville, Australia.,Orygen Youth Health, Northwestern Mental Health, Melbourne, Australia
| |
Collapse
|
22
|
McNeish D. Effect Partitioning in Cross-Sectionally Clustered Data Without Multilevel Models. MULTIVARIATE BEHAVIORAL RESEARCH 2019; 54:906-925. [PMID: 31021178 DOI: 10.1080/00273171.2019.1602504] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Effect partitioning is almost exclusively performed with multilevel models (MLMs) - so much so that some have considered the two to be synonymous. MLMs are able to provide estimates with desirable statistical properties when data come from a hierarchical structure; but the random effects included in MLMs are not always integral to the analysis. As a result, other methods with relaxed assumptions are viable options in many cases. Through empirical examples and simulations, we show how generalized estimating equations (GEEs) can be used to effectively partition effects without random effects. We show that more onerous steps of MLMs such as determining the number of random effects and the structure for their covariance can be bypassed with GEEs while still obtaining identical or near-identical results. Additionally, violations of distributional assumptions adversely affect estimates with MLMs but have no effect on GEEs because no such assumptions are made. This makes GEEs a flexible alternative to MLMs with minimal assumptions that may warrant consideration. Limitations of GEEs for partitioning effects are also discussed.
Collapse
|
23
|
Kripalani S, Chen G, Ciampa P, Theobald C, Cao A, McBride M, Dittus RS, Speroff T. A transition care coordinator model reduces hospital readmissions and costs. Contemp Clin Trials 2019; 81:55-61. [PMID: 31029692 DOI: 10.1016/j.cct.2019.04.014] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 04/01/2019] [Accepted: 04/24/2019] [Indexed: 11/26/2022]
Abstract
BACKGROUND The optimal structure and intensity of interventions to reduce hospital readmission remains uncertain, due in part to lack of head-to-head comparison. To address this gap, we evaluated two forms of an evidence-based, multi-component transitional care intervention. METHODS A quasi-experimental evaluation design compared outcomes of Transition Care Coordinator (TCC) Care to Usual Care, while controlling for sociodemographic characteristics, comorbidities, readmission risk, and administrative factors. The study was conducted between January 1, 2013 and April 30, 2015 as a quality improvement initiative. Eligible adults (N = 7038) hospitalized with pneumonia, congestive heart failure, or chronic obstructive pulmonary disease were identified for program evaluation via an electronic health record algorithm. Nurse TCCs provided either a full intervention (delivered in-hospital and by post-discharge phone call) or a partial intervention (phone call only). RESULTS A total of 762 hospitalizations with TCC Care (460 full intervention and 302 partial intervention) and 6276 with Usual Care was examined. In multivariable models, hospitalizations with TCC Care had significantly lower odds of readmission at 30 days (OR = 0.512, 95% CI 0.392 to 0.668) and 90 days (OR = 0.591, 95% CI 0.483 to 0.723). Adjusted costs were significantly lower at 30 days (difference = $3969, 95% CI $5099 to $2691) and 90 days (difference = $5684, 95% CI $7602 to $3627). The effect was similar whether patients received the full or partial intervention. CONCLUSION An evidence-based multi-component intervention delivered by nurse TCCs reduced 30- and 90-day readmissions and associated health care costs. Lower intensity interventions delivered by telephone after discharge may have similar effectiveness to in-hospital programs.
Collapse
Affiliation(s)
- Sunil Kripalani
- Division of General Internal Medicine and Public Health, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Center for Clinical Quality and Implementation Research, Vanderbilt University Medical Center, USA; Center for Health Services Research, Vanderbilt University Medical Center, USA.
| | - Guanhua Chen
- Department of Biostatistics & Medical Informatics, University of Wisconsin - Madison, Madison, WI, USA
| | - Philip Ciampa
- Atrius Health, Center for Healthcare Innovation, Newton, MA, USA
| | - Cecelia Theobald
- Division of General Internal Medicine and Public Health, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Center for Clinical Quality and Implementation Research, Vanderbilt University Medical Center, USA
| | - Aize Cao
- Department of Biomedical Informatics, Vanderbilt University Medical Center, USA
| | - Megan McBride
- Office of Population Health, Vanderbilt University Medical Center, USA
| | - Robert S Dittus
- Division of General Internal Medicine and Public Health, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Center for Clinical Quality and Implementation Research, Vanderbilt University Medical Center, USA; Center for Health Services Research, Vanderbilt University Medical Center, USA; Department of Veterans Affairs, Valley Healthcare System Geriatric Research Education and Clinical Center (GRECC), TN, USA
| | - Theodore Speroff
- Division of General Internal Medicine and Public Health, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Center for Clinical Quality and Implementation Research, Vanderbilt University Medical Center, USA; Center for Health Services Research, Vanderbilt University Medical Center, USA; Department of Biostatistics & Medical Informatics, University of Wisconsin - Madison, Madison, WI, USA; Department of Veterans Affairs, Valley Healthcare System Geriatric Research Education and Clinical Center (GRECC), TN, USA
| |
Collapse
|
24
|
Mondol MH, Rahman MS. Bias-reduced and separation-proof GEE with small or sparse longitudinal binary data. Stat Med 2019; 38:2544-2560. [PMID: 30793784 DOI: 10.1002/sim.8126] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2017] [Revised: 11/08/2018] [Accepted: 01/22/2019] [Indexed: 11/10/2022]
Abstract
Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra-cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth-type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small-sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi-complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near-to-separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias-corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias-corrected sandwich covariance estimator is recommended to use for small-to-moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near-to-separation.
Collapse
Affiliation(s)
| | - M Shafiqur Rahman
- Institute of Statistical Research and Training, University of Dhaka, Dhaka, Bangladesh
| |
Collapse
|
25
|
|
26
|
Westgate PM, Burchett WW. On the analysis of very small samples of Gaussian repeated measurements: an alternative approach. Stat Med 2017; 36:958-970. [PMID: 28064473 PMCID: PMC5291809 DOI: 10.1002/sim.7199] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Revised: 10/13/2016] [Accepted: 11/21/2016] [Indexed: 11/06/2022]
Abstract
The analysis of very small samples of Gaussian repeated measurements can be challenging. First, due to a very small number of independent subjects contributing outcomes over time, statistical power can be quite small. Second, nuisance covariance parameters must be appropriately accounted for in the analysis in order to maintain the nominal test size. However, available statistical strategies that ensure valid statistical inference may lack power, whereas more powerful methods may have the potential for inflated test sizes. Therefore, we explore an alternative approach to the analysis of very small samples of Gaussian repeated measurements, with the goal of maintaining valid inference while also improving statistical power relative to other valid methods. This approach uses generalized estimating equations with a bias-corrected empirical covariance matrix that accounts for all small-sample aspects of nuisance correlation parameter estimation in order to maintain valid inference. Furthermore, the approach utilizes correlation selection strategies with the goal of choosing the working structure that will result in the greatest power. In our study, we show that when accurate modeling of the nuisance correlation structure impacts the efficiency of regression parameter estimation, this method can improve power relative to existing methods that yield valid inference. Copyright © 2017 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Philip M Westgate
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, 40536, KY, U.S.A
| | - Woodrow W Burchett
- Department of Statistics, College of Arts and Sciences, University of Kentucky, Lexington, 40536, KY, U.S.A
| |
Collapse
|
27
|
Yang W, Liao S. A study of quadratic inference functions with alternative weighting matrices. COMMUN STAT-SIMUL C 2017. [DOI: 10.1080/03610918.2014.988255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Weiming Yang
- College of Mathematics and Statistics, Chongqing Technology and Business University, Chongqing, China
| | - Shu Liao
- College of Mathematics and Statistics, Chongqing Technology and Business University, Chongqing, China
| |
Collapse
|
28
|
|
29
|
Component, group and demographic Allee effects in a cooperatively breeding bird species, the Arabian babbler (Turdoides squamiceps). Oecologia 2016; 182:153-61. [DOI: 10.1007/s00442-016-3656-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2015] [Accepted: 05/11/2016] [Indexed: 10/21/2022]
|
30
|
Huang S, Fiero MH, Bell ML. Generalized estimating equations in cluster randomized trials with a small number of clusters: Review of practice and simulation study. Clin Trials 2016; 13:445-9. [DOI: 10.1177/1740774516643498] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Background/aims: Generalized estimating equations are a common modeling approach used in cluster randomized trials to account for within-cluster correlation. It is well known that the sandwich variance estimator is biased when the number of clusters is small (≤40), resulting in an inflated type I error rate. Various bias correction methods have been proposed in the statistical literature, but how adequately they are utilized in current practice for cluster randomized trials is not clear. The aim of this study is to evaluate the use of generalized estimating equation bias correction methods in recently published cluster randomized trials and demonstrate the necessity of such methods when the number of clusters is small. Methods: Review of cluster randomized trials published between August 2013 and July 2014 and using generalized estimating equations for their primary analyses. Two independent reviewers collected data from each study using a standardized, pre-piloted data extraction template. A two-arm cluster randomized trial was simulated under various scenarios to show the potential effect of a small number of clusters on type I error rate when estimating the treatment effect. The nominal level was set at 0.05 for the simulation study. Results: Of the 51 included trials, 28 (54.9%) analyzed 40 or fewer clusters with a minimum of four total clusters. Of these 28 trials, only one trial used a bias correction method for generalized estimating equations. The simulation study showed that with four clusters, the type I error rate ranged between 0.43 and 0.47. Even though type I error rate moved closer to the nominal level as the number of clusters increases, it still ranged between 0.06 and 0.07 with 40 clusters. Conclusions: Our results showed that statistical issues arising from small number of clusters in generalized estimating equations is currently inadequately handled in cluster randomized trials. Potential for type I error inflation could be very high when the sandwich estimator is used without bias correction.
Collapse
Affiliation(s)
- Shuang Huang
- Departments of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA
| | - Mallorie H Fiero
- Departments of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA
| | - Melanie L Bell
- Departments of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
31
|
Westgate PM, Burchett WW. Improving power in small-sample longitudinal studies when using generalized estimating equations. Stat Med 2016; 35:3733-44. [PMID: 27090375 DOI: 10.1002/sim.6967] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Revised: 03/22/2016] [Accepted: 03/23/2016] [Indexed: 11/06/2022]
Abstract
Generalized estimating equations (GEE) are often used for the marginal analysis of longitudinal data. Although much work has been performed to improve the validity of GEE for the analysis of data arising from small-sample studies, little attention has been given to power in such settings. Therefore, we propose a valid GEE approach to improve power in small-sample longitudinal study settings in which the temporal spacing of outcomes is the same for each subject. Specifically, we use a modified empirical sandwich covariance matrix estimator within correlation structure selection criteria and test statistics. Use of this estimator can improve the accuracy of selection criteria and increase the degrees of freedom to be used for inference. The resulting impacts on power are demonstrated via a simulation study and application example. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Philip M Westgate
- Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, 40536, KY, U.S.A
| | - Woodrow W Burchett
- Department of Statistics, College of Arts and Sciences, University of Kentucky, Lexington, 40536, KY, U.S.A
| |
Collapse
|
32
|
Singh KK, Qin M, Brummel SS, Angelidou K, Trout RN, Fenton T, Spector SA. Killer Cell Immunoglobulin-Like Receptor Alleles Alter HIV Disease in Children. PLoS One 2016; 11:e0151364. [PMID: 26983081 PMCID: PMC4794224 DOI: 10.1371/journal.pone.0151364] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2015] [Accepted: 02/27/2016] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND HLA class I molecules are ligands for killer cell immunoglobin like receptors (KIR) that control the antiviral response of natural killer (NK) cells. However, the effects of KIR and HLA (KIR/HLA) alleles on HIV disease of children have not been studied. METHODS 993 antiretroviral naïve children with symptomatic HIV infection from PACTG protocols P152 and P300 were genotyped for KIR and HLA alleles using the Luminex platform. Linear regression was used to test the association between genotypes and baseline pre-ART HIV RNA, CD4+ lymphocyte count, and cognitive score, adjusting for age, race/ethnicity and study. The interaction between genetic markers and age was investigated. To account for multiple testing the false discovery rate (FDR) was controlled at 0.05. RESULTS Children with the KIR2DS4*ALL FULL LENGTH (KIR2DS4*AFL) allele had higher CD4+ lymphocyte counts. Among children ≤2 years of age, the KIR2DS4*AFL was associated with lower plasma HIV RNA and higher cognitive index scores. KIR Cent2DS3/5_1 had lower CD4+ lymphocyte counts in children ≤2 years of age, while the presence of Tel1, Tel2DS4_2, Tel2DS4_4, Tel8, Tel2DS4_6 had higher CD4+ lymphocyte counts in all children. Presence of Cent2, Cent4 and Cent8 was associated with increased HIV RNA load in children ≤2 years. Presence of KIR3DL1+Bw4 was associated with higher CD4+ lymphocyte counts in all children. Among children >2 years old, KIR3DS1+Bw4-80I was associated with higher plasma HIV RNA, and Bw6/Bw6 was associated with lower plasma HIV RNA compared to children with KIR3DS1+Bw4-80I. CONCLUSIONS Presented data show for the first time that specific KIR alleles independently or combined with HLA ligands are associated with HIV RNA and CD4+ lymphocyte counts in infected, antiretroviral naive children; and many of these effect estimates appear to be age dependent. These data support a role for specific KIR alleles in HIV pathogenesis in children.
Collapse
Affiliation(s)
- Kumud K. Singh
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
| | - Min Qin
- Center for Biostatistics in AIDS Research, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Sean S. Brummel
- Center for Biostatistics in AIDS Research, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Konstantia Angelidou
- Center for Biostatistics in AIDS Research, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Rodney N. Trout
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
| | - Terence Fenton
- Center for Biostatistics in AIDS Research, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Stephen A. Spector
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
- Rady Children’s Hospital, San Diego, California, United States of America
| |
Collapse
|
33
|
Keynan O, Ridley AR, Lotem A. Task‐Dependent Differences in Learning by Subordinate and Dominant Wild Arabian Babblers. Ethology 2016. [DOI: 10.1111/eth.12488] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Oded Keynan
- Department of Zoology Faculty of Life Sciences Tel‐Aviv University Tel‐Aviv Israel
- Department of Biological Sciences Macquarie University Sydney NSW Australia
- Dead Sea & Arava Science Center Central Arava Israel
| | - Amanda R. Ridley
- Department of Biological Sciences Macquarie University Sydney NSW Australia
- Centre of Evolutionary Biology School of Animal Biology University of Western Australia Perth WA Australia
| | - Arnon Lotem
- Department of Zoology Faculty of Life Sciences Tel‐Aviv University Tel‐Aviv Israel
| |
Collapse
|
34
|
Genome-wide gene-environment interactions on quantitative traits using family data. Eur J Hum Genet 2015; 24:1022-8. [PMID: 26626313 DOI: 10.1038/ejhg.2015.253] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Revised: 10/09/2015] [Accepted: 10/27/2015] [Indexed: 12/15/2022] Open
Abstract
Gene-environment interactions may provide a mechanism for targeting interventions to those individuals who would gain the most benefit from them. Searching for interactions agnostically on a genome-wide scale requires large sample sizes, often achieved through collaboration among multiple studies in a consortium. Family studies can contribute to consortia, but to do so they must account for correlation within families by using specialized analytic methods. In this paper, we investigate the performance of methods that account for within-family correlation, in the context of gene-environment interactions with binary exposures and quantitative outcomes. We simulate both cross-sectional and longitudinal measurements, and analyze the simulated data taking family structure into account, via generalized estimating equations (GEE) and linear mixed-effects models. With sufficient exposure prevalence and correct model specification, all methods perform well. However, when models are misspecified, mixed modeling approaches have seriously inflated type I error rates. GEE methods with robust variance estimates are less sensitive to model misspecification; however, when exposures are infrequent, GEE methods require modifications to preserve type I error rate. We illustrate the practical use of these methods by evaluating gene-drug interactions on fasting glucose levels in data from the Framingham Heart Study, a cohort that includes related individuals.
Collapse
|
35
|
Wang M, Kong L, Li Z, Zhang L. Covariance estimators for generalized estimating equations (GEE) in longitudinal analysis with small samples. Stat Med 2015; 35:1706-21. [PMID: 26585756 DOI: 10.1002/sim.6817] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2015] [Revised: 09/09/2015] [Accepted: 10/28/2015] [Indexed: 11/07/2022]
Abstract
Generalized estimating equations (GEE) is a general statistical method to fit marginal models for longitudinal data in biomedical studies. The variance-covariance matrix of the regression parameter coefficients is usually estimated by a robust "sandwich" variance estimator, which does not perform satisfactorily when the sample size is small. To reduce the downward bias and improve the efficiency, several modified variance estimators have been proposed for bias-correction or efficiency improvement. In this paper, we provide a comprehensive review on recent developments of modified variance estimators and compare their small-sample performance theoretically and numerically through simulation and real data examples. In particular, Wald tests and t-tests based on different variance estimators are used for hypothesis testing, and the guideline on appropriate sample sizes for each estimator is provided for preserving type I error in general cases based on numerical results. Moreover, we develop a user-friendly R package "geesmv" incorporating all of these variance estimators for public usage in practice.
Collapse
Affiliation(s)
- Ming Wang
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, U.S.A
| | - Lan Kong
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, U.S.A
| | - Zheng Li
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, U.S.A
| | - Lijun Zhang
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA, U.S.A.,Institute for Personalized Medicine, Penn State College of Medicine, Hershey, PA, U.S.A
| |
Collapse
|
36
|
Sitlani CM, Rice KM, Lumley T, McKnight B, Cupples LA, Avery CL, Noordam R, Stricker BH, Whitsel EA, Psaty BM. Generalized estimating equations for genome-wide association studies using longitudinal phenotype data. Stat Med 2015; 34:118-30. [PMID: 25297442 PMCID: PMC4321952 DOI: 10.1002/sim.6323] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Revised: 06/04/2014] [Accepted: 09/18/2014] [Indexed: 01/08/2023]
Abstract
Many longitudinal cohort studies have both genome-wide measures of genetic variation and repeated measures of phenotypes and environmental exposures. Genome-wide association study analyses have typically used only cross-sectional data to evaluate quantitative phenotypes and binary traits. Incorporation of repeated measures may increase power to detect associations, but also requires specialized analysis methods. Here, we discuss one such method-generalized estimating equations (GEE)-in the contexts of analysis of main effects of rare genetic variants and analysis of gene-environment interactions. We illustrate the potential for increased power using GEE analyses instead of cross-sectional analyses. We also address challenges that arise, such as the need for small-sample corrections when the minor allele frequency of a genetic variant and/or the prevalence of an environmental exposure is low. To illustrate methods for detection of gene-drug interactions on a genome-wide scale, using repeated measures data, we conduct single-study analyses and meta-analyses across studies in three large cohort studies participating in the Cohorts for Heart and Aging Research in Genomic Epidemiology consortium-the Atherosclerosis Risk in Communities study, the Cardiovascular Health Study, and the Rotterdam Study.
Collapse
Affiliation(s)
| | - Kenneth M. Rice
- Department of Biostatistics, University of Washington, Seattle,
WA
| | - Thomas Lumley
- Department of Statistics, University of Auckland, Auckland,
NZ
| | - Barbara McKnight
- Department of Biostatistics, University of Washington, Seattle,
WA
| | | | - Christy L. Avery
- Department of Epidemiology, University of North Carolina at
Chapel Hill, Chapel Hill, NC
| | - Raymond Noordam
- Department of Internal Medicine, Erasmus Medical Center,
Rotterdam, NL
- Department of Epidemiology, Erasmus Medical Center, Rotterdam,
NL
| | | | - Eric A. Whitsel
- Departments of Epidemiology and Medicine, University of North
Carolina at Chapel Hill, Chapel Hill, NC
| | - Bruce M. Psaty
- Departments of Medicine, Epidemiology, and Health Services,
University of Washington, Seattle, WA
- Group Health Research Institute, Group Health Cooperative,
Seattle, WA
| |
Collapse
|
37
|
Generalized Estimating Equations in Longitudinal Data Analysis: A Review and Recent Developments. ACTA ACUST UNITED AC 2014. [DOI: 10.1155/2014/303728] [Citation(s) in RCA: 104] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Generalized Estimating Equation (GEE) is a marginal model popularly applied for longitudinal/clustered data analysis in clinical trials or biomedical studies. We provide a systematic review on GEE including basic concepts as well as several recent developments due to practical challenges in real applications. The topics including the selection of “working” correlation structure, sample size and power calculation, and the issue of informative cluster size are covered because these aspects play important roles in GEE utilization and its statistical inference. A brief summary and discussion of potential research interests regarding GEE are provided in the end.
Collapse
|
38
|
Li P, Redden DT. Small sample performance of bias-corrected sandwich estimators for cluster-randomized trials with binary outcomes. Stat Med 2014; 34:281-96. [PMID: 25345738 DOI: 10.1002/sim.6344] [Citation(s) in RCA: 118] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 10/07/2014] [Indexed: 11/08/2022]
Abstract
The sandwich estimator in generalized estimating equations (GEE) approach underestimates the true variance in small samples and consequently results in inflated type I error rates in hypothesis testing. This fact limits the application of the GEE in cluster-randomized trials (CRTs) with few clusters. Under various CRT scenarios with correlated binary outcomes, we evaluate the small sample properties of the GEE Wald tests using bias-corrected sandwich estimators. Our results suggest that the GEE Wald z-test should be avoided in the analyses of CRTs with few clusters even when bias-corrected sandwich estimators are used. With t-distribution approximation, the Kauermann and Carroll (KC)-correction can keep the test size to nominal levels even when the number of clusters is as low as 10 and is robust to the moderate variation of the cluster sizes. However, in cases with large variations in cluster sizes, the Fay and Graubard (FG)-correction should be used instead. Furthermore, we derive a formula to calculate the power and minimum total number of clusters one needs using the t-test and KC-correction for the CRTs with binary outcomes. The power levels as predicted by the proposed formula agree well with the empirical powers from the simulations. The proposed methods are illustrated using real CRT data. We conclude that with appropriate control of type I error rates under small sample sizes, we recommend the use of GEE approach in CRTs with binary outcomes because of fewer assumptions and robustness to the misspecification of the covariance structure.
Collapse
Affiliation(s)
- Peng Li
- Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL 35294, U.S.A
| | | |
Collapse
|
39
|
Gunzler D, Tang W, Lu N, Wu P, Tu XM. A class of distribution-free models for longitudinal mediation analysis. PSYCHOMETRIKA 2014; 79:543-568. [PMID: 24271505 PMCID: PMC4825877 DOI: 10.1007/s11336-013-9355-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2012] [Indexed: 06/02/2023]
Abstract
Mediation analysis constitutes an important part of treatment study to identify the mechanisms by which an intervention achieves its effect. Structural equation model (SEM) is a popular framework for modeling such causal relationship. However, current methods impose various restrictions on the study designs and data distributions, limiting the utility of the information they provide in real study applications. In particular, in longitudinal studies missing data is commonly addressed under the assumption of missing at random (MAR), where current methods are unable to handle such missing data if parametric assumptions are violated.In this paper, we propose a new, robust approach to address the limitations of current SEM within the context of longitudinal mediation analysis by utilizing a class of functional response models (FRM). Being distribution-free, the FRM-based approach does not impose any parametric assumption on data distributions. In addition, by extending the inverse probability weighted (IPW) estimates to the current context, the FRM-based SEM provides valid inference for longitudinal mediation analysis under the two most popular missing data mechanisms; missing completely at random (MCAR) and missing at random (MAR). We illustrate the approach with both real and simulated data.
Collapse
Affiliation(s)
- D Gunzler
- Center for Health Care Research & Policy, Case Western Reserve University at MetroHealth Medical Center, 2500 MetroHealth Drive, Cleveland, OH, 44109-1998, USA,
| | | | | | | | | |
Collapse
|
40
|
Scott JM, deCamp A, Juraska M, Fay MP, Gilbert PB. Finite-sample corrected generalized estimating equation of population average treatment effects in stepped wedge cluster randomized trials. Stat Methods Med Res 2014; 26:583-597. [PMID: 25267551 DOI: 10.1177/0962280214552092] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Stepped wedge designs are increasingly commonplace and advantageous for cluster randomized trials when it is both unethical to assign placebo, and it is logistically difficult to allocate an intervention simultaneously to many clusters. We study marginal mean models fit with generalized estimating equations for assessing treatment effectiveness in stepped wedge cluster randomized trials. This approach has advantages over the more commonly used mixed models that (1) the population-average parameters have an important interpretation for public health applications and (2) they avoid untestable assumptions on latent variable distributions and avoid parametric assumptions about error distributions, therefore, providing more robust evidence on treatment effects. However, cluster randomized trials typically have a small number of clusters, rendering the standard generalized estimating equation sandwich variance estimator biased and highly variable and hence yielding incorrect inferences. We study the usual asymptotic generalized estimating equation inferences (i.e., using sandwich variance estimators and asymptotic normality) and four small-sample corrections to generalized estimating equation for stepped wedge cluster randomized trials and for parallel cluster randomized trials as a comparison. We show by simulation that the small-sample corrections provide improvement, with one correction appearing to provide at least nominal coverage even with only 10 clusters per group. These results demonstrate the viability of the marginal mean approach for both stepped wedge and parallel cluster randomized trials. We also study the comparative performance of the corrected methods for stepped wedge and parallel designs, and describe how the methods can accommodate interval censoring of individual failure times and incorporate semiparametric efficient estimators.
Collapse
Affiliation(s)
- JoAnna M Scott
- 1 Department of Pediatric Dentistry, University of Washington, Seattle, Washington, USA
| | - Allan deCamp
- 2 Fred Hutchinson Cancer Research Center, Seattle, Washington, USA.,3 Department of Biostatistics, University of Washington, Seattle, Washington, USA
| | - Michal Juraska
- 2 Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Michael P Fay
- 4 Division of Biostatistics, National Institute of Allergies and Infectious Diseases, Bethesda, USA
| | - Peter B Gilbert
- 2 Fred Hutchinson Cancer Research Center, Seattle, Washington, USA.,3 Department of Biostatistics, University of Washington, Seattle, Washington, USA
| |
Collapse
|
41
|
Qin M, Brummel S, Singh KK, Fenton T, Spector SA. Associations of host genetic variants on CD4⁺ lymphocyte count and plasma HIV-1 RNA in antiretroviral naïve children. Pediatr Infect Dis J 2014; 33:946-52. [PMID: 24797997 PMCID: PMC4216611 DOI: 10.1097/inf.0000000000000330] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
BACKGROUND CD4 T-lymphocyte (CD4) counts and HIV plasma RNA concentration (RNA) are 2 key HIV disease markers. The complex interplay between virus and host genetics may contribute to differences in viral set point and CD4 status. Determining the effects of host genetic variation on HIV disease markers is often complicated by the use of antiretroviral therapy. In this study, the association between genetic variants and baseline HIV RNA and CD4 counts was examined in a large cohort of antiretroviral naïve children. METHODS Specimens from 1053 HIV-infected children were screened for single nucleotide polymorphisms in 78 regions from 17 genes. Linear regression with a robust variance estimator was used to test the association between genetic markers with HIV RNA and CD4 count, controlling for age, race/ethnicity and study. False discovery rate (FDR) controlling was used to adjust for multiple testing. RESULTS The study population was 60% black, 26% Hispanic and 13% white; median age 2.35 years; 55% female. Baseline median CD4 count was 780/mm; median log10 HIV RNA was 5.17 copies/mL. For analyses of the associations of genetic makers with baseline CD4 count, 6 HLA and 4 additional markers exhibited P < 0.05, but none met the criteria for statistical significance with FDR controlled at 0.05. For baseline HIV RNA, HLA DRB1*15, DRB1*10, B-27/57, B-14, Cw-8, B-57 were statistically significant with FDR controlled at 0.05. CONCLUSIONS These results provide strong evidence that HLA DRB1*15, DRB1*10, B-27/57, B-14, Cw-8, B-57 are associated with HIV RNA and play a role in HIV pathogenesis in infected children.
Collapse
Affiliation(s)
- Min Qin
- Center for Biostatistics in AIDS Research, Harvard School of Public Health, Boston, MA
| | - Sean Brummel
- Center for Biostatistics in AIDS Research, Harvard School of Public Health, Boston, MA
| | - Kumud K. Singh
- University of California, San Diego, La Jolla, CA and Rady Children’s Hospital, San Diego, CA
| | - Terry Fenton
- Center for Biostatistics in AIDS Research, Harvard School of Public Health, Boston, MA
| | - Stephen A. Spector
- University of California, San Diego, La Jolla, CA and Rady Children’s Hospital, San Diego, CA
| |
Collapse
|
42
|
Zhang Y, Xu Z, Shen X, Pan W. Testing for association with multiple traits in generalized estimation equations, with application to neuroimaging data. Neuroimage 2014; 96:309-25. [PMID: 24704269 PMCID: PMC4043944 DOI: 10.1016/j.neuroimage.2014.03.061] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Revised: 02/14/2014] [Accepted: 03/23/2014] [Indexed: 11/17/2022] Open
Abstract
There is an increasing need to develop and apply powerful statistical tests to detect multiple traits-single locus associations, as arising from neuroimaging genetics and other studies. For example, in the Alzheimer's Disease Neuroimaging Initiative (ADNI), in addition to genome-wide single nucleotide polymorphisms (SNPs), thousands of neuroimaging and neuropsychological phenotypes as intermediate phenotypes for Alzheimer's disease, have been collected. Although some classic methods like MANOVA and newly proposed methods may be applied, they have their own limitations. For example, MANOVA cannot be applied to binary and other discrete traits. In addition, the relationships among these methods are not well understood. Importantly, since these tests are not data adaptive, depending on the unknown association patterns among multiple traits and between multiple traits and a locus, these tests may or may not be powerful. In this paper we propose a class of data-adaptive weights and the corresponding weighted tests in the general framework of generalized estimation equations (GEE). A highly adaptive test is proposed to select the most powerful one from this class of the weighted tests so that it can maintain high power across a wide range of situations. Our proposed tests are applicable to various types of traits with or without covariates. Importantly, we also analytically show relationships among some existing and our proposed tests, indicating that many existing tests are special cases of our proposed tests. Extensive simulation studies were conducted to compare and contrast the power properties of various existing and our new methods. Finally, we applied the methods to an ADNI dataset to illustrate the performance of the methods. We conclude with the recommendation for the use of the GEE-based Score test and our proposed adaptive test for their high and complementary performance.
Collapse
Affiliation(s)
- Yiwei Zhang
- Division of Biostatistics, School of Public Health, Minneapolis, MN 55455, USA
| | - Zhiyuan Xu
- Division of Biostatistics, School of Public Health, Minneapolis, MN 55455, USA
| | - Xiaotong Shen
- School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Wei Pan
- Division of Biostatistics, School of Public Health, Minneapolis, MN 55455, USA.
| |
Collapse
|
43
|
Guillaume B, Hua X, Thompson PM, Waldorp L, Nichols TE. Fast and accurate modelling of longitudinal and repeated measures neuroimaging data. Neuroimage 2014; 94:287-302. [PMID: 24650594 PMCID: PMC4073654 DOI: 10.1016/j.neuroimage.2014.03.029] [Citation(s) in RCA: 131] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2013] [Revised: 02/25/2014] [Accepted: 03/10/2014] [Indexed: 02/01/2023] Open
Abstract
Despite the growing importance of longitudinal data in neuroimaging, the standard analysis methods make restrictive or unrealistic assumptions (e.g., assumption of Compound Symmetry--the state of all equal variances and equal correlations--or spatially homogeneous longitudinal correlations). While some new methods have been proposed to more accurately account for such data, these methods are based on iterative algorithms that are slow and failure-prone. In this article, we propose the use of the Sandwich Estimator method which first estimates the parameters of interest with a simple Ordinary Least Square model and second estimates variances/covariances with the "so-called" Sandwich Estimator (SwE) which accounts for the within-subject correlation existing in longitudinal data. Here, we introduce the SwE method in its classic form, and we review and propose several adjustments to improve its behaviour, specifically in small samples. We use intensive Monte Carlo simulations to compare all considered adjustments and isolate the best combination for neuroimaging data. We also compare the SwE method to other popular methods and demonstrate its strengths and weaknesses. Finally, we analyse a highly unbalanced longitudinal dataset from the Alzheimer's Disease Neuroimaging Initiative and demonstrate the flexibility of the SwE method to fit within- and between-subject effects in a single model. Software implementing this SwE method has been made freely available at http://warwick.ac.uk/tenichols/SwE.
Collapse
Affiliation(s)
- Bryan Guillaume
- Cyclotron Research Centre, University of Liège, 4000 Liège, Belgium; Department of Statistics, University of Warwick, Coventry, UK; Global Imaging Unit, GlaxoSmithKline, Stevenage, UK
| | - Xue Hua
- Imaging Genetics Center, Laboratory of Neuro Imaging, Dept. of Neurology & Psychiatry, UCLA School of Medicine, Los Angeles, CA 90095, USA
| | - Paul M Thompson
- Imaging Genetics Center, Laboratory of Neuro Imaging, Dept. of Neurology & Psychiatry, UCLA School of Medicine, Los Angeles, CA 90095, USA
| | - Lourens Waldorp
- Department of Psychological Methods, University of Amsterdam, Amsterdam, The Netherlands
| | - Thomas E Nichols
- Department of Statistics, University of Warwick, Coventry, UK; Warwick Manufacturing Group, University of Warwick, Coventry, UK; Oxford Centre for Functional MRI of the Brain, University of Oxford, Oxford, UK
| |
Collapse
|
44
|
Westgate PM. Improving the correlation structure selection approach for generalized estimating equations and balanced longitudinal data. Stat Med 2014; 33:2222-37. [DOI: 10.1002/sim.6106] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Revised: 01/07/2014] [Accepted: 01/16/2014] [Indexed: 11/08/2022]
Affiliation(s)
- Philip M. Westgate
- Department of Biostatistics, College of Public Health; University of Kentucky; Lexington KY 40536 U.S.A
| |
Collapse
|
45
|
Stoklosa J, Gibb H, Warton DI. Fast forward selection for generalized estimating equations with a large number of predictor variables. Biometrics 2013; 70:110-20. [DOI: 10.1111/biom.12118] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2013] [Revised: 09/01/2013] [Accepted: 09/01/2013] [Indexed: 11/26/2022]
Affiliation(s)
- Jakub Stoklosa
- School of Mathematics and Statistics and Evolution & Ecology Research Centre; The University of New South Wales; NSW 2052 Australia
| | - Heloise Gibb
- Department of Zoology; La Trobe University; Victoria 3068 Australia
| | - David I. Warton
- School of Mathematics and Statistics and Evolution & Ecology Research Centre; The University of New South Wales; NSW 2052 Australia
| |
Collapse
|
46
|
Wu P, Tu XM, Kowalski J. On assessing model fit for distribution-free longitudinal models under missing data. Stat Med 2013; 33:143-57. [PMID: 23897653 DOI: 10.1002/sim.5908] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2012] [Revised: 04/30/2013] [Accepted: 06/13/2013] [Indexed: 11/10/2022]
Abstract
The generalized estimating equation (GEE), a distribution-free, or semi-parametric, approach for modeling longitudinal data, is used in a wide range of behavioral, psychotherapy, pharmaceutical drug safety, and healthcare-related research studies. Most popular methods for assessing model fit are based on the likelihood function for parametric models, rendering them inappropriate for distribution-free GEE. One rare exception is a score statistic initially proposed by Tsiatis for logistic regression (1980) and later extended by Barnhart and Willamson to GEE (1998). Because GEE only provides valid inference under the missing completely at random assumption and missing values arising in most longitudinal studies do not follow such a restricted mechanism, this GEE-based score test has very limited applications in practice. We propose extensions of this goodness-of-fit test to address missing data under the missing at random assumption, a more realistic model that applies to most studies in practice. We examine the performance of the proposed tests using simulated data and demonstrate the utilities of such tests with data from a real study on geriatric depression and associated medical comorbidities.
Collapse
Affiliation(s)
- P Wu
- Department of Biostatistics and Computational Biology, Rochester, NY, 14623, U.S.A
| | | | | |
Collapse
|
47
|
Westgate PM. On small-sample inference in group randomized trials with binary outcomes and cluster-level covariates. Biom J 2013; 55:789-806. [DOI: 10.1002/bimj.201200237] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Revised: 05/01/2013] [Accepted: 06/01/2013] [Indexed: 11/09/2022]
Affiliation(s)
- Philip M. Westgate
- Department of Biostatistics, College of Public Health; University of Kentucky, 725 Rose Street; Lexington,; KY 40536; USA
| |
Collapse
|
48
|
Westgate PM. A bias correction for covariance estimators to improve inference with generalized estimating equations that use an unstructured correlation matrix. Stat Med 2012; 32:2850-8. [DOI: 10.1002/sim.5709] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 11/27/2012] [Indexed: 11/09/2022]
Affiliation(s)
- Philip M. Westgate
- Department of Biostatistics; College of Public Health, University of Kentucky; Lexington KY 40536 U.S.A
| |
Collapse
|
49
|
Yu Q, Chen R, Tang W, He H, Gallop R, Crits-Christoph P, Hu J, Tu XM. Distribution-free models for longitudinal count responses with overdispersion and structural zeros. Stat Med 2012; 32:2390-405. [PMID: 23239019 DOI: 10.1002/sim.5691] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2010] [Accepted: 10/31/2012] [Indexed: 11/10/2022]
Abstract
Overdispersion and structural zeros are two major manifestations of departure from the Poisson assumption when modeling count responses using Poisson log-linear regression. As noted in a large body of literature, ignoring such departures could yield bias and lead to wrong conclusions. Different approaches have been developed to tackle these two major problems. In this paper, we review available methods for dealing with overdispersion and structural zeros within a longitudinal data setting and propose a distribution-free modeling approach to address the limitations of these methods by utilizing a new class of functional response models. We illustrate our approach with both simulated and real study data.
Collapse
Affiliation(s)
- Q Yu
- Department of Biostatistics and Computational Biology, University of Rochester, 601 Elmwoord Ave, Rochester, NY 14642, USA.
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Yue LQ. Asymmetric Effects of Fashions on the Formation and Dissolution of Networks: Board Interlocks with Internet Companies, 1996–2006. ORGANIZATION SCIENCE 2012. [DOI: 10.1287/orsc.1110.0683] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|