Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sauerbrei W, Abrahamowicz M, Altman DG, le Cessie S, Carpenter J, on behalf of the STRATOS initiative. STRengthening analytical thinking for observational studies: the STRATOS initiative. Stat Med 2014;33:5413-32. [PMID: 25074480 PMCID: PMC4320765 DOI: 10.1002/sim.6265] [Citation(s) in RCA: 81] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 05/28/2014] [Accepted: 06/20/2014] [Indexed: 12/18/2022]

For:	Sauerbrei W, Abrahamowicz M, Altman DG, le Cessie S, Carpenter J, on behalf of the STRATOS initiative. STRengthening analytical thinking for observational studies: the STRATOS initiative. Stat Med 2014;33:5413-32. [PMID: 25074480 PMCID: PMC4320765 DOI: 10.1002/sim.6265] [Citation(s) in RCA: 81] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 05/28/2014] [Accepted: 06/20/2014] [Indexed: 12/18/2022]

Number

Cited by Other Article(s)

Curtis K, Keogh S, Krishnasamy M, Gough K. Central Venous Access Device Complications and Premature Removal in Patients With Haematological Malignancies: A Multi-Site Cohort Study. EJHAEM 2025;6:e1090. [PMID: 40134700 PMCID: PMC11936046 DOI: 10.1002/jha2.1090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Revised: 12/03/2024] [Accepted: 12/05/2024] [Indexed: 03/27/2025]

Abstract

Background

Patients with haematological malignancies require urgent and reliable venous access for the administration of systemic anticancer therapies (SACTs) commonly via central venous access devices (CVADs). Disease pathophysiology and side effects of SACTs increase the risk of complications during the dwell time and premature removal. CVAD complications are associated with treatment disruption, increased morbidity and mortality. This study aimed to comprehensively describe CVAD performance over a 12-month period in patients with haematological malignancies.

Methods

A multi-site cohort study at four tertiary hospitals in Melbourne, Australia was undertaken using multidisciplinary data from patient health records and administrative datasets including patient, device, insertion, maintenance, complication and removal data. Cases of interest were CVADs, ascertained using lists provided by the insertion services.

Findings

A total of 1078 CVADs were inserted in 673 patients between 1 September 2020 and 31 August 2021. Of the 1078 CVADs, 197 (18%) remained in situ, and 881 (82%) were removed, of which 369 (42%) were removed prematurely due to infection (n = 208, 57%) and non-infection related reasons (n = 201, 54%). Most CVADs (n = 919, 85%) had documented complications during their dwell time and the proportion of premature removals in these CVADs was over two-fold higher than CVADs with no documented complications. Multivariable Cox regression results indicated that CVAD type, urgency of the procedure, concurrent CVADs and insertion technology were associated with an increased risk of premature removal. Clinical variations in insertion and management care throughout the life of a CVAD and current evidence were identified.

Conclusion

An unacceptably high proportion of CVADs had complications documented during the dwell time and were prematurely removed. Inconsistencies in current evidence and clinical practice highlight opportunities to positively impact CVAD outcomes in this cohort.

Trial Registration

The authors have confirmed clinical trial registration is not needed for this submission.

Collapse

Jones L, Barnett A, Vagenas D. Linear regression reporting practices for health researchers, a cross-sectional meta-research study. PLoS One 2025;20:e0305150. [PMID: 40111967 PMCID: PMC11925299 DOI: 10.1371/journal.pone.0305150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Accepted: 01/26/2025] [Indexed: 03/22/2025] Open

Abstract

BACKGROUND

Decisions about health care, such as the effectiveness of new treatments for disease, are regularly made based on evidence from published work. However, poor reporting of statistical methods and results is endemic across health research and risks ineffective or harmful treatments being used in clinical practice. Statistical modelling choices often greatly influence the results. Authors do not always provide enough information to evaluate and repeat their methods, making interpreting results difficult. Our research is designed to understand current reporting practices and inform efforts to educate researchers.

METHODS

Reporting practices for linear regression were assessed in 95 randomly sampled published papers in the health field from PLOS ONE in 2019, which were randomly allocated to statisticians for post-publication review. The prevalence of reporting practices is described using frequencies, percentages, and Wilson 95% confidence intervals.

RESULTS

While 92% of authors reported p-values and 81% reported regression coefficients, only 58% of papers reported a measure of uncertainty, such as confidence intervals or standard errors. Sixty-nine percent of authors did not discuss the scientific importance of estimates, and only 23% directly interpreted the size of coefficients.

CONCLUSION

Our results indicate that statistical methods and results were often poorly reported without sufficient detail to reproduce them. To improve statistical quality and direct health funding to effective treatments, we recommend that statisticians be involved in the research cycle, from study design to post-peer review. The research environment is an ecosystem, and future interventions addressing poor statistical quality should consider the interactions between the individuals, organisations and policy environments. Practical recommendations include journals producing templates with standardised reporting and using interactive checklists to improve reporting practices. Investments in research maintenance and quality control are required to assess and implement these recommendations to improve the quality of health research.

Collapse

Abrahamowicz M, Beauchamp ME, Boulesteix AL, Morris TP, Sauerbrei W, Kaufman JS, STRATOS Simulation Panel OBOT. Data-driven simulations to assess the impact of study imperfections in time-to-event analyses. Am J Epidemiol 2025;194:233-242. [PMID: 38717330 PMCID: PMC7617302 DOI: 10.1093/aje/kwae058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 04/05/2024] [Accepted: 04/25/2024] [Indexed: 01/16/2025] Open

Speiser JL, Kerr WT, Ziegler A. Common Critiques and Recommendations for Studies in Neurology Using Machine Learning Methods. Neurology 2024;103:e209861. [PMID: 39236270 PMCID: PMC11379123 DOI: 10.1212/wnl.0000000000209861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 07/11/2024] [Indexed: 09/07/2024] Open

Abstract

Machine learning (ML) methods are becoming more prevalent in the neurology literature as alternatives to traditional statistical methods to address challenges in the analysis of modern data sets. Despite the increase in the popularity of ML methods in neurology studies, some authors do not fully address all items recommended in reporting guidelines. The authors of this Research Methods article are members of the Neurology® editorial board and have reviewed many studies using ML methods. In their review reports, several critiques often appear, which could be avoided if guidance were available. In this article, we detail common critiques found in ML research studies and make recommendations for how to avoid them. The first critique involves misalignment of the study goals and the analysis conducted. The second critique focuses on ML terminology being appropriately used. Critiques 3-6 are related to the study design: justifying sample sizes and the suitability of the data set for the study goals, describing the ML analysis pipeline sufficiently, quantifying the amount of missing data and providing information about missing data handling, and including uncertainty estimates for key metrics. The seventh critique focuses on fairly describing both strengths and limitations of the ML study, including the analysis methodology and results. We provide examples in neurology for each critique and guidance on how to avoid the critique. Overall, we recommend that authors use ML-specific checklists developed by research consortia for designing and reporting studies using ML. We also recommend that authors involve both a statistician and an ML expert in work that uses ML. Although our list of critiques is not exhaustive, our recommendations should help improve the quality and rigor of ML studies. ML has great potential to revolutionize neurology, but investigators need to conduct and report the results in a way that allows readers to fully evaluate the benefits and limitations of ML approaches.

Collapse

Losciale JM, Truong LK, Ward P, Collins GS, Bullock GS. Limitations of Separating Athletes into High or Low-Risk Groups based on a Cut-Off. A Clinical Commentary. Int J Sports Phys Ther 2024;19:1151-1164. [PMID: 39229450 PMCID: PMC11368444 DOI: 10.26603/001c.122644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 07/19/2024] [Indexed: 09/05/2024] Open

Abstract

Background

Athlete injury risk assessment and management is an important, yet challenging task for sport and exercise medicine professionals. A common approach to injury risk screening is to stratify athletes into risk groups based on their performance on a test relative to a cut-off threshold. However, one potential reason for ineffective injury prevention efforts is the over-reliance on identifying these 'at-risk' groups using arbitrary cut-offs for these tests and measures. The purpose of this commentary is to discuss the conceptual and technical issues related to the use of a cut-off in both research and clinical practice.

Clinical Question

How can we better assess and interpret clinical tests or measures to enable a more effective injury risk assessment in athletes?

Key Results

Cut-offs typically lack strong biologic plausibility to support them; and are typically derived in a data-driven manner and thus not generalizable to other samples. When a cut-off is used in analyses, information is lost, leading to potentially misleading results and less accurate injury risk prediction. Dichotomizing a continuous variable using a cut-off should be avoided. Using continuous variables on its original scale is advantageous because information is not discarded, outcome prediction accuracy is not lost, and personalized medicine can be facilitated.

Clinical Application

Researchers and clinicians are encouraged to analyze and interpret the results of tests and measures using continuous variables and avoid relying on singular cut-offs to guide decisions. Injury risk can be predicted more accurately when using continuous variables in their natural form. A more accurate risk prediction will facilitate personalized approaches to injury risk mitigation and may lead to a decline in injury rates.

Level of Evidence

Collapse

Ullmann T, Heinze G, Hafermann L, Schilhart-Wallisch C, Dunkler D, for TG2 of the STRATOS initiative. Evaluating variable selection methods for multivariable regression models: A simulation study protocol. PLoS One 2024;19:e0308543. [PMID: 39121055 PMCID: PMC11315300 DOI: 10.1371/journal.pone.0308543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 07/25/2024] [Indexed: 08/11/2024] Open

Abstract

Researchers often perform data-driven variable selection when modeling the associations between an outcome and multiple independent variables in regression analysis. Variable selection may improve the interpretability, parsimony and/or predictive accuracy of a model. Yet variable selection can also have negative consequences, such as false exclusion of important variables or inclusion of noise variables, biased estimation of regression coefficients, underestimated standard errors and invalid confidence intervals, as well as model instability. While the potential advantages and disadvantages of variable selection have been discussed in the literature for decades, few large-scale simulation studies have neutrally compared data-driven variable selection methods with respect to their consequences for the resulting models. We present the protocol for a simulation study that will evaluate different variable selection methods: forward selection, stepwise forward selection, backward elimination, augmented backward elimination, univariable selection, univariable selection followed by backward elimination, and penalized likelihood approaches (Lasso, relaxed Lasso, adaptive Lasso). These methods will be compared with respect to false inclusion and/or exclusion of variables, consequences on bias and variance of the estimated regression coefficients, the validity of the confidence intervals for the coefficients, the accuracy of the estimated variable importance ranking, and the predictive performance of the selected models. We consider both linear and logistic regression in a low-dimensional setting (20 independent variables with 10 true predictors and 10 noise variables). The simulation will be based on real-world data from the National Health and Nutrition Examination Survey (NHANES). Publishing this study protocol ahead of performing the simulation increases transparency and allows integrating the perspective of other experts into the study design.

Collapse

Lusa L, Proust-Lima C, Schmidt CO, Lee KJ, le Cessie S, Baillie M, Lawrence F, Huebner M, on behalf of TG3 of the STRATOS Initiative. Initial data analysis for longitudinal studies to build a solid foundation for reproducible analysis. PLoS One 2024;19:e0295726. [PMID: 38809844 PMCID: PMC11135704 DOI: 10.1371/journal.pone.0295726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 03/13/2024] [Indexed: 05/31/2024] Open

Abstract

Initial data analysis (IDA) is the part of the data pipeline that takes place between the end of data retrieval and the beginning of data analysis that addresses the research question. Systematic IDA and clear reporting of the IDA findings is an important step towards reproducible research. A general framework of IDA for observational studies includes data cleaning, data screening, and possible updates of pre-planned statistical analyses. Longitudinal studies, where participants are observed repeatedly over time, pose additional challenges, as they have special features that should be taken into account in the IDA steps before addressing the research question. We propose a systematic approach in longitudinal studies to examine data properties prior to conducting planned statistical analyses. In this paper we focus on the data screening element of IDA, assuming that the research aims are accompanied by an analysis plan, meta-data are well documented, and data cleaning has already been performed. IDA data screening comprises five types of explorations, covering the analysis of participation profiles over time, evaluation of missing data, presentation of univariate and multivariate descriptions, and the depiction of longitudinal aspects. Executing the IDA plan will result in an IDA report to inform data analysts about data properties and possible implications for the analysis plan-another element of the IDA framework. Our framework is illustrated focusing on hand grip strength outcome data from a data collection across several waves in a complex survey. We provide reproducible R code on a public repository, presenting a detailed data screening plan for the investigation of the average rate of age-associated decline of grip strength. With our checklist and reproducible R code we provide data analysts a framework to work with longitudinal data in an informed way, enhancing the reproducibility and validity of their work.

Collapse

Huebner M, Bond L, Stukes F, Herndon J, Edwards DJ, Pomann GM. Developing partnerships for academic data science consulting and collaboration units. Stat (Int Stat Inst) 2024;13:e644. [PMID: 39238953 PMCID: PMC11376992 DOI: 10.1002/sta4.644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 12/05/2023] [Indexed: 09/07/2024]

Heinze G, Boulesteix AL, Kammer M, Morris TP, White IR. Phases of methodological research in biostatistics-Building the evidence base for new methods. Biom J 2024;66:e2200222. [PMID: 36737675 PMCID: PMC7615508 DOI: 10.1002/bimj.202200222] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 12/09/2022] [Accepted: 01/22/2023] [Indexed: 02/05/2023]

Kelter R. The Bayesian simulation study (BASIS) framework for simulation studies in statistical and methodological research. Biom J 2024;66:e2200095. [PMID: 36642811 DOI: 10.1002/bimj.202200095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 12/07/2022] [Accepted: 12/10/2022] [Indexed: 01/17/2023]

Arnes JI, Hapfelmeier A, Horsch A, Braaten T. Greedy knot selection algorithm for restricted cubic spline regression. FRONTIERS IN EPIDEMIOLOGY 2023;3:1283705. [PMID: 38455941 PMCID: PMC10910934 DOI: 10.3389/fepid.2023.1283705] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 11/17/2023] [Indexed: 03/09/2024]

Schmid M, Friede T, Klein N, Weinhold L. Accounting for time dependency in meta-analyses of concordance probability estimates. Res Synth Methods 2023;14:807-823. [PMID: 37429580 DOI: 10.1002/jrsm.1655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 04/21/2023] [Accepted: 06/19/2023] [Indexed: 07/12/2023]

Ma J, Dhiman P, Qi C, Bullock G, van Smeden M, Riley RD, Collins GS. Poor handling of continuous predictors in clinical prediction models using logistic regression: a systematic review. J Clin Epidemiol 2023;161:140-151. [PMID: 37536504 PMCID: PMC11913776 DOI: 10.1016/j.jclinepi.2023.07.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 07/20/2023] [Accepted: 07/26/2023] [Indexed: 08/05/2023]

Boe LA, Shaw PA, Midthune D, Gustafson P, Kipnis V, Park E, Sotres-Alvarez D, Freedman L, of the STRATOS Initiative OBOTMEAMTG(TG. Issues in Implementing Regression Calibration Analyses. Am J Epidemiol 2023;192:1406-1414. [PMID: 37092245 PMCID: PMC10666971 DOI: 10.1093/aje/kwad098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 02/27/2023] [Accepted: 04/13/2023] [Indexed: 04/25/2023] Open

Rahnenführer J, De Bin R, Benner A, Ambrogi F, Lusa L, Boulesteix AL, Migliavacca E, Binder H, Michiels S, Sauerbrei W, McShane L. Statistical analysis of high-dimensional biomedical data: a gentle introduction to analytical goals, common approaches and challenges. BMC Med 2023;21:182. [PMID: 37189125 DOI: 10.1186/s12916-023-02858-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 04/03/2023] [Indexed: 05/17/2023] Open

Abstract

BACKGROUND

In high-dimensional data (HDD) settings, the number of variables associated with each observation is very large. Prominent examples of HDD in biomedical research include omics data with a large number of variables such as many measurements across the genome, proteome, or metabolome, as well as electronic health records data that have large numbers of variables recorded for each patient. The statistical analysis of such data requires knowledge and experience, sometimes of complex methods adapted to the respective research questions.

METHODS

Advances in statistical methodology and machine learning methods offer new opportunities for innovative analyses of HDD, but at the same time require a deeper understanding of some fundamental statistical concepts. Topic group TG9 "High-dimensional data" of the STRATOS (STRengthening Analytical Thinking for Observational Studies) initiative provides guidance for the analysis of observational studies, addressing particular statistical challenges and opportunities for the analysis of studies involving HDD. In this overview, we discuss key aspects of HDD analysis to provide a gentle introduction for non-statisticians and for classically trained statisticians with little experience specific to HDD.

RESULTS

The paper is organized with respect to subtopics that are most relevant for the analysis of HDD, in particular initial data analysis, exploratory data analysis, multiple testing, and prediction. For each subtopic, main analytical goals in HDD settings are outlined. For each of these goals, basic explanations for some commonly used analysis methods are provided. Situations are identified where traditional statistical methods cannot, or should not, be used in the HDD setting, or where adequate analytic tools are still lacking. Many key references are provided.

CONCLUSIONS

This review aims to provide a solid statistical foundation for researchers, including statisticians and non-statisticians, who are new to research with HDD or simply want to better evaluate and understand the results of HDD analyses.

Collapse

Sauerbrei W, Kipruto E, Balmford J. Effects of influential points and sample size on the selection and replicability of multivariable fractional polynomial models. Diagn Progn Res 2023;7:7. [PMID: 37069621 PMCID: PMC10111698 DOI: 10.1186/s41512-023-00145-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 01/27/2023] [Indexed: 04/19/2023] Open

Abstract

BACKGROUND

The multivariable fractional polynomial (MFP) approach combines variable selection using backward elimination with a function selection procedure (FSP) for fractional polynomial (FP) functions. It is a relatively simple approach which can be easily understood without advanced training in statistical modeling. For continuous variables, a closed test procedure is used to decide between no effect, linear, FP1, or FP2 functions. Influential points (IPs) and small sample sizes can both have a strong impact on a selected function and MFP model.

METHODS

We used simulated data with six continuous and four categorical predictors to illustrate approaches which can help to identify IPs with an influence on function selection and the MFP model. Approaches use leave-one or two-out and two related techniques for a multivariable assessment. In eight subsamples, we also investigated the effects of sample size and model replicability, the latter by using three non-overlapping subsamples with the same sample size. For better illustration, a structured profile was used to provide an overview of all analyses conducted.

RESULTS

The results showed that one or more IPs can drive the functions and models selected. In addition, with a small sample size, MFP was not able to detect some non-linear functions and the selected model differed substantially from the true underlying model. However, when the sample size was relatively large and regression diagnostics were carefully conducted, MFP selected functions or models that were similar to the underlying true model.

CONCLUSIONS

For smaller sample size, IPs and low power are important reasons that the MFP approach may not be able to identify underlying functional relationships for continuous variables and selected models might differ substantially from the true model. However, for larger sample sizes, a carefully conducted MFP analysis is often a suitable way to select a multivariable regression model which includes continuous variables. In such a case, MFP can be the preferred approach to derive a multivariable descriptive model.

Collapse

Friedrich S, Groll A, Ickstadt K, Kneib T, Pauly M, Rahnenführer J, Friede T. Regularization approaches in clinical biostatistics: A review of methods and their applications. Stat Methods Med Res 2023;32:425-440. [PMID: 36384320 PMCID: PMC9896544 DOI: 10.1177/09622802221133557] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

McLernon DJ, Giardiello D, Van Calster B, Wynants L, van Geloven N, van Smeden M, Therneau T, Steyerberg EW. Assessing Performance and Clinical Usefulness in Prediction Models With Survival Outcomes: Practical Guidance for Cox Proportional Hazards Models. Ann Intern Med 2023;176:105-114. [PMID: 36571841 DOI: 10.7326/m22-0844] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Jørgensen CK, Olsen MH, Nielsen N, Lange T, Mbuagbaw L, Thabane L, Billot L, Binder N, Garattini S, Banzi R, Demotes J, Biagioli E, Rulli E, Bertolini G, Nattino G, Mathiesen O, Torri V, Gluud C, Jakobsen JC. Centre for Statistical and Methodological Excellence (CESAME): A Consortium Initiative for Improving Methodology in Randomised Clinical Trials. Health Serv Insights 2023;16:11786329231166519. [PMID: 37077323 PMCID: PMC10107963 DOI: 10.1177/11786329231166519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 03/13/2023] [Indexed: 04/21/2023] Open

Affiliation(s)

Caroline Kamp Jørgensen Copenhagen Trial Unit, Centre for Clinical Intervention Research, The Capital Region, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark Department of Regional Health Research, The Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark Caroline Kamp Jørgensen, Copenhagen Trial Unit, Centre for Clinical Intervention Research, The Capital Region, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Blegdamsvej 9, Kobenhavn 2100, Denmark.
Markus Harboe Olsen Copenhagen Trial Unit, Centre for Clinical Intervention Research, The Capital Region, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark Department of Neuroanaesthesiology, the Neuroscience Centre, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
Niklas Nielsen Department of Clinical Sciences, Faculty of Medicine, Lund University, Sweden
Theis Lange Department of Public Health/Section of Biostatistics, Copenhagen University, Copenhagen, Denmark
Lawrence Mbuagbaw Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada Biostatistics Unit, St Joseph’s Healthcare Hamilton, Hamilton ON, Canada
Lehana Thabane Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada Biostatistics Unit, St Joseph’s Healthcare Hamilton, Hamilton ON, Canada Health Faculty of Health Sciences, University of Johannesburg, Johannesburg, South Africa
Laurent Billot The George Institute for Global Health, University of New South Wales, Sydney, NSW, Australia
Nadine Binder Department of Data Driven Medicine, Institute of General Practice/Family Medicine, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
Silvio Garattini Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Rita Banzi Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Jacques Demotes ECRIN European Clinical Research Infrastructure Network, Paris, France
Elena Biagioli Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Eliana Rulli Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Guido Bertolini Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Giovanni Nattino Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Ole Mathiesen Centre for Anaesthesiological Research, Department of Anaesthesiology, Zealand University Hospital, Køge, Denmark Department of Clincal Medicine, Copenhagen University, Copenhagen, Denmark
Valter Torri Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
Christian Gluud Copenhagen Trial Unit, Centre for Clinical Intervention Research, The Capital Region, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark Department of Regional Health Research, The Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark
Janus Christian Jakobsen Copenhagen Trial Unit, Centre for Clinical Intervention Research, The Capital Region, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark Department of Regional Health Research, The Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark

Collapse

Comparison of variable selection procedures and investigation of the role of shrinkage in linear regression-protocol of a simulation study in low-dimensional data. PLoS One 2022;17:e0271240. [PMID: 36191290 PMCID: PMC9529280 DOI: 10.1371/journal.pone.0271240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/24/2022] [Indexed: 11/06/2022] Open

Gravesteijn BY, Steyerberg EW, Lingsma HF. Modern Learning from Big Data in Critical Care: Primum Non Nocere. Neurocrit Care 2022;37:174-184. [PMID: 35513752 PMCID: PMC9071245 DOI: 10.1007/s12028-022-01510-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 04/06/2022] [Indexed: 12/13/2022]

Abstract

Large and complex data sets are increasingly available for research in critical care. To analyze these data, researchers use techniques commonly referred to as statistical learning or machine learning (ML). The latter is known for large successes in the field of diagnostics, for example, by identification of radiological anomalies. In other research areas, such as clustering and prediction studies, there is more discussion regarding the benefit and efficiency of ML techniques compared with statistical learning. In this viewpoint, we aim to explain commonly used statistical learning and ML techniques and provide guidance for responsible use in the case of clustering and prediction questions in critical care. Clustering studies have been increasingly popular in critical care research, aiming to inform how patients can be characterized, classified, or treated differently. An important challenge for clustering studies is to ensure and assess generalizability. This limits the application of findings in these studies toward individual patients. In the case of predictive questions, there is much discussion as to what algorithm should be used to most accurately predict outcome. Aspects that determine usefulness of ML, compared with statistical techniques, include the volume of the data, the dimensionality of the preferred model, and the extent of missing data. There are areas in which modern ML methods may be preferred. However, efforts should be made to implement statistical frameworks (e.g., for dealing with missing data or measurement error, both omnipresent in clinical data) in ML methods. To conclude, there are important opportunities but also pitfalls to consider when performing clustering or predictive studies with ML techniques. We advocate careful valuation of new data-driven findings. More interaction is needed between the engineer mindset of experts in ML methods, the insight in bias of epidemiologists, and the probabilistic thinking of statisticians to extract as much information and knowledge from data as possible, while avoiding harm.

Collapse

Dwivedi AK. How to write statistical analysis section in medical research. J Investig Med 2022;70:1759-1770. [PMID: 35710142 DOI: 10.1136/jim-2022-002479] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/01/2022] [Indexed: 12/15/2022]

Abstract

Reporting of statistical analysis is essential in any clinical and translational research study. However, medical research studies sometimes report statistical analysis that is either inappropriate or insufficient to attest to the accuracy and validity of findings and conclusions. Published works involving inaccurate statistical analyses and insufficient reporting influence the conduct of future scientific studies, including meta-analyses and medical decisions. Although the biostatistical practice has been improved over the years due to the involvement of statistical reviewers and collaborators in research studies, there remain areas of improvement for transparent reporting of the statistical analysis section in a study. Evidence-based biostatistics practice throughout the research is useful for generating reliable data and translating meaningful data to meaningful interpretation and decisions in medical research. Most existing research reporting guidelines do not provide guidance for reporting methods in the statistical analysis section that helps in evaluating the quality of findings and data interpretation. In this report, we highlight the global and critical steps to be reported in the statistical analysis of grants and research articles. We provide clarity and the importance of understanding study objective types, data generation process, effect size use, evidence-based biostatistical methods use, and development of statistical models through several thematic frameworks. We also provide published examples of adherence or non-adherence to methodological standards related to each step in the statistical analysis and their implications. We believe the suggestions provided in this report can have far-reaching implications for education and strengthening the quality of statistical reporting and biostatistical practice in medical research.

Collapse

van Geloven N, Giardiello D, Bonneville EF, Teece L, Ramspek CL, van Smeden M, Snell KIE, van Calster B, Pohar-Perme M, Riley RD, Putter H, Steyerberg E. Validation of prediction models in the presence of competing risks: a guide through modern methods. BMJ 2022;377:e069249. [PMID: 35609902 DOI: 10.1136/bmj-2021-069249] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Sauerbrei W, Haeussler T, Balmford J, Huebner M. Structured reporting to improve transparency of analyses in prognostic marker studies. BMC Med 2022;20:184. [PMID: 35546237 PMCID: PMC9095054 DOI: 10.1186/s12916-022-02304-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 02/17/2022] [Indexed: 11/24/2022] Open

Abstract

BACKGROUND

Factors contributing to the lack of understanding of research studies include poor reporting practices, such as selective reporting of statistically significant findings or insufficient methodological details. Systematic reviews have shown that prognostic factor studies continue to be poorly reported, even for important aspects, such as the effective sample size. The REMARK reporting guidelines support researchers in reporting key aspects of tumor marker prognostic studies. The REMARK profile was proposed to augment these guidelines to aid in structured reporting with an emphasis on including all aspects of analyses conducted.

METHODS

A systematic search of prognostic factor studies was conducted, and fifteen studies published in 2015 were selected, three from each of five oncology journals. A paper was eligible for selection if it included survival outcomes and multivariable models were used in the statistical analyses. For each study, we summarized the key information in a REMARK profile consisting of details about the patient population with available variables and follow-up data, and a list of all analyses conducted.

RESULTS

Structured profiles allow an easy assessment if reporting of a study only has weaknesses or if it is poor because many relevant details are missing. Studies had incomplete reporting of exclusion of patients, missing information about the number of events, or lacked details about statistical analyses, e.g., subgroup analyses in small populations without any information about the number of events. Profiles exhibit severe weaknesses in the reporting of more than 50% of the studies. The quality of analyses was not assessed, but some profiles exhibit several deficits at a glance.

CONCLUSIONS

A substantial part of prognostic factor studies is poorly reported and analyzed, with severe consequences for related systematic reviews and meta-analyses. We consider inadequate reporting of single studies as one of the most important reasons that the clinical relevance of most markers is still unclear after years of research and dozens of publications. We conclude that structured reporting is an important step to improve the quality of prognostic marker research and discuss its role in the context of selective reporting, meta-analysis, study registration, predefined statistical analysis plans, and improvement of marker research.

Collapse

Kuss O, Becher H, Wienke A, Ittermann T, Ostrzinski S, Schipf S, Schmidt CO, Leitzmann M, Pischon T, Krist L, Roll S, Sand M, Pohlabeln H, Rach S, Jöckel KH, Stang A, Mueller UA, Werdecker A, Westerman R, Greiser KH, Michels KB. Statistical Analysis in the German National Cohort (NAKO) - Specific Aspects and General Recommendations. Eur J Epidemiol 2022;37:429-436. [PMID: 35653006 PMCID: PMC9187540 DOI: 10.1007/s10654-022-00880-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 05/05/2022] [Indexed: 11/03/2022]

Affiliation(s)

Oliver Kuss Institute for Biometrics and Epidemiology, German Diabetes Center, Leibniz Institute for Diabetes Research, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany. Centre for Health and Society, Medical Faculty, University Hospital Düsseldorf, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany.
Heiko Becher Institute for Medical Biometry and Epidemiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Andreas Wienke Institute of Medical Epidemiology, Biometrics, and Informatics, Martin-Luther-University Halle-Wittenberg, Halle, Germany
Till Ittermann Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
Stefan Ostrzinski Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
Sabine Schipf Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
Carsten O Schmidt Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
Michael Leitzmann Department of Epidemiology and Preventive Medicine, Regensburg University Medical Center, Regensburg, Germany
Tobias Pischon Molecular Epidemiology Research Group, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
Lilian Krist Institute of Social Medicine, Epidemiology and Health Economics, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Stephanie Roll Institute of Social Medicine, Epidemiology and Health Economics, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Matthias Sand GESIS - Leibniz-Institute for Social Sciences, Mannheim, Germany
Hermann Pohlabeln Leibniz Institute for Prevention Research and Epidemiology - BIPS, Bremen, Germany
Stefan Rach Leibniz Institute for Prevention Research and Epidemiology - BIPS, Bremen, Germany
Karl-Heinz Jöckel Institute for Medical Informatics, Biometry and Epidemiology, University Hospital Essen, Essen, Germany
Andreas Stang Institute for Medical Informatics, Biometry and Epidemiology, University Hospital Essen, Essen, Germany
Ulrich A Mueller Federal Institute for Population Research, Wiesbaden, Germany
Andrea Werdecker Federal Institute for Population Research, Wiesbaden, Germany
Ronny Westerman Federal Institute for Population Research, Wiesbaden, Germany
Karin H Greiser Division of Cancer Epidemiology, DKFZ Heidelberg, Heidelberg, Germany
Karin B Michels Institute for Prevention and Cancer Epidemiology, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles, California, USA

Collapse

Elhakeem A, Hughes RA, Tilling K, Cousminer DL, Jackowski SA, Cole TJ, Kwong ASF, Li Z, Grant SFA, Baxter-Jones ADG, Zemel BS, Lawlor DA. Using linear and natural cubic splines, SITAR, and latent trajectory models to characterise nonlinear longitudinal growth trajectories in cohort studies. BMC Med Res Methodol 2022;22:68. [PMID: 35291947 PMCID: PMC8925070 DOI: 10.1186/s12874-022-01542-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 02/11/2022] [Indexed: 11/23/2022] Open

Abstract

BACKGROUND

Longitudinal data analysis can improve our understanding of the influences on health trajectories across the life-course. There are a variety of statistical models which can be used, and their fitting and interpretation can be complex, particularly where there is a nonlinear trajectory. Our aim was to provide an accessible guide along with applied examples to using four sophisticated modelling procedures for describing nonlinear growth trajectories.

METHODS

This expository paper provides an illustrative guide to summarising nonlinear growth trajectories for repeatedly measured continuous outcomes using (i) linear spline and (ii) natural cubic spline linear mixed-effects (LME) models, (iii) Super Imposition by Translation and Rotation (SITAR) nonlinear mixed effects models, and (iv) latent trajectory models. The underlying model for each approach, their similarities and differences, and their advantages and disadvantages are described. Their application and correct interpretation of their results is illustrated by analysing repeated bone mass measures to characterise bone growth patterns and their sex differences in three cohort studies from the UK, USA, and Canada comprising 8500 individuals and 37,000 measurements from ages 5-40 years. Recommendations for choosing a modelling approach are provided along with a discussion and signposting on further modelling extensions for analysing trajectory exposures and outcomes, and multiple cohorts.

RESULTS

Linear and natural cubic spline LME models and SITAR provided similar summary of the mean bone growth trajectory and growth velocity, and the sex differences in growth patterns. Growth velocity (in grams/year) peaked during adolescence, and peaked earlier in females than males e.g., mean age at peak bone mineral content accrual from multicohort SITAR models was 12.2 years in females and 13.9 years in males. Latent trajectory models (with trajectory shapes estimated using a natural cubic spline) identified up to four subgroups of individuals with distinct trajectories throughout adolescence.

CONCLUSIONS

LME models with linear and natural cubic splines, SITAR, and latent trajectory models are useful for describing nonlinear growth trajectories, and these methods can be adapted for other complex traits. Choice of method depends on the research aims, complexity of the trajectory, and available data. Scripts and synthetic datasets are provided for readers to replicate trajectory modelling and visualisation using the R statistical computing software.

Collapse

Affiliation(s)

Ahmed Elhakeem MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK. Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
Rachael A Hughes MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
Kate Tilling MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
Diana L Cousminer Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA, USA Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Stefan A Jackowski College of Kinesiology, University of Saskatchewan, Saskatoon, Saskatchewan, Canada Children's Hospital of Eastern Ontario Research Institute, Ottawa, Ontario, Canada
Tim J Cole UCL Great Ormond Street Institute of Child Health, London, UK
Alex S F Kwong MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK Division of Psychiatry, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, UK
Zheyuan Li School of Mathematics and Statistics, Henan University, Kaifeng, Henan, China Department of Statistics and Actuarial Sciences, Simon Fraser University, Burnaby, BC, Canada
Struan F A Grant Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA, USA Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA Department of Pediatrics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA Division of Endocrinology and Diabetes, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Adam D G Baxter-Jones College of Kinesiology, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
Babette S Zemel Department of Pediatrics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA Division of Gastroenterology, Hepatology and Nutrition, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Deborah A Lawlor MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK

Collapse

Modeling pulse wave velocity trajectories—challenges, opportunities, and pitfalls. Kidney Int 2022;101:459-462. [DOI: 10.1016/j.kint.2021.12.025] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 12/21/2021] [Indexed: 01/08/2023]

Wallisch C, Bach P, Hafermann L, Klein N, Sauerbrei W, Steyerberg EW, Heinze G, Rauch G, on behalf of topic group 2 of the STRATOS initiative. Review of guidance papers on regression modeling in statistical series of medical journals. PLoS One 2022;17:e0262918. [PMID: 35073384 PMCID: PMC8786189 DOI: 10.1371/journal.pone.0262918] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 01/08/2022] [Indexed: 12/23/2022] Open

Abstract

Although regression models play a central role in the analysis of medical research projects, there still exist many misconceptions on various aspects of modeling leading to faulty analyses. Indeed, the rapidly developing statistical methodology and its recent advances in regression modeling do not seem to be adequately reflected in many medical publications. This problem of knowledge transfer from statistical research to application was identified by some medical journals, which have published series of statistical tutorials and (shorter) papers mainly addressing medical researchers. The aim of this review was to assess the current level of knowledge with regard to regression modeling contained in such statistical papers. We searched for target series by a request to international statistical experts. We identified 23 series including 57 topic-relevant articles. Within each article, two independent raters analyzed the content by investigating 44 predefined aspects on regression modeling. We assessed to what extent the aspects were explained and if examples, software advices, and recommendations for or against specific methods were given. Most series (21/23) included at least one article on multivariable regression. Logistic regression was the most frequently described regression type (19/23), followed by linear regression (18/23), Cox regression and survival models (12/23) and Poisson regression (3/23). Most general aspects on regression modeling, e.g. model assumptions, reporting and interpretation of regression results, were covered. We did not find many misconceptions or misleading recommendations, but we identified relevant gaps, in particular with respect to addressing nonlinear effects of continuous predictors, model specification and variable selection. Specific recommendations on software were rarely given. Statistical guidance should be developed for nonlinear effects, model specification and variable selection to better support medical researchers who perform or interpret regression analyses.

Collapse

Teerawattananon Y, Anothaisintawee T, Pheerapanyawaranun C, Botwright S, Akksilp K, Sirichumroonwit N, Budtarad N, Isaranuwatchai W. A systematic review of methodological approaches for evaluating real-world effectiveness of COVID-19 vaccines: Advising resource-constrained settings. PLoS One 2022;17:e0261930. [PMID: 35015761 PMCID: PMC8752025 DOI: 10.1371/journal.pone.0261930] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 12/13/2021] [Indexed: 01/19/2023] Open

Abstract

Real-world effectiveness studies are important for monitoring performance of COVID-19 vaccination programmes and informing COVID-19 prevention and control policies. We aimed to synthesise methodological approaches used in COVID-19 vaccine effectiveness studies, in order to evaluate which approaches are most appropriate to implement in low- and middle-income countries (LMICs). For this rapid systematic review, we searched PubMed and Scopus for articles published from inception to July 7, 2021, without language restrictions. We included any type of peer-reviewed observational study measuring COVID-19 vaccine effectiveness, for any population. We excluded randomised control trials and modelling studies. All data used in the analysis were extracted from included papers. We used a standardised data extraction form, modified from STrengthening the Reporting of OBservational studies in Epidemiology (STROBE). Study quality was assessed using the REal Life EVidence AssessmeNt Tool (RELEVANT) tool. This study is registered with PROSPERO, CRD42021264658. Our search identified 3,327 studies, of which 42 were eligible for analysis. Most studies (97.5%) were conducted in high-income countries and the majority assessed mRNA vaccines (78% mRNA only, 17% mRNA and viral vector, 2.5% viral vector, 2.5% inactivated vaccine). Thirty-five of the studies (83%) used a cohort study design. Across studies, short follow-up time and limited assessment and mitigation of potential confounders, including previous SARS-CoV-2 infection and healthcare seeking behaviour, were major limitations. This review summarises methodological approaches for evaluating real-world effectiveness of COVID-19 vaccines and highlights the lack of such studies in LMICs, as well as the importance of context-specific vaccine effectiveness data. Further research in LMICs will refine guidance for conducting real-world COVID-19 vaccine effectiveness studies in resource-constrained settings.

Collapse

Haneef R, Tijhuis M, Thiébaut R, Májek O, Pristaš I, Tolenan H, Gallay A. Methodological guidelines to estimate population-based health indicators using linked data and/or machine learning techniques. Arch Public Health 2022;80:9. [PMID: 34983651 PMCID: PMC8725299 DOI: 10.1186/s13690-021-00770-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 12/17/2021] [Indexed: 12/23/2022] Open

Abstract

BACKGROUND

The capacity to use data linkage and artificial intelligence to estimate and predict health indicators varies across European countries. However, the estimation of health indicators from linked administrative data is challenging due to several reasons such as variability in data sources and data collection methods resulting in reduced interoperability at various levels and timeliness, availability of a large number of variables, lack of skills and capacity to link and analyze big data. The main objective of this study is to develop the methodological guidelines calculating population-based health indicators to guide European countries using linked data and/or machine learning (ML) techniques with new methods.

METHOD

We have performed the following step-wise approach systematically to develop the methodological guidelines: i. Scientific literature review, ii. Identification of inspiring examples from European countries, and iii. Developing the checklist of guidelines contents.

RESULTS

We have developed the methodological guidelines, which provide a systematic approach for studies using linked data and/or ML-techniques to produce population-based health indicators. These guidelines include a detailed checklist of the following items: rationale and objective of the study (i.e., research question), study design, linked data sources, study population/sample size, study outcomes, data preparation, data analysis (i.e., statistical techniques, sensitivity analysis and potential issues during data analysis) and study limitations.

CONCLUSIONS

This is the first study to develop the methodological guidelines for studies focused on population health using linked data and/or machine learning techniques. These guidelines would support researchers to adopt and develop a systematic approach for high-quality research methods. There is a need for high-quality research methodologies using more linked data and ML-techniques to develop a structured cross-disciplinary approach for improving the population health information and thereby the population health.

Collapse

Can we use existing guidance to support the development of robust real-world evidence for health technology assessment/payer decision-making? Int J Technol Assess Health Care 2022;38:e79. [DOI: 10.1017/s0266462322000605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Farjat AE, Virdone S, Thomas LE, Kakkar AK, Pieper KS, Piccini JP. The importance of the design of observational studies in comparative effectiveness research: Lessons from the GARFIELD-AF and ORBIT-AF registries. Am Heart J 2022;243:110-121. [PMID: 34529945 DOI: 10.1016/j.ahj.2021.09.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 09/07/2021] [Indexed: 12/11/2022]

Pang M, Platt RW, Schuster T, Abrahamowicz M. Flexible extension of the accelerated failure time model to account for nonlinear and time-dependent effects of covariates on the hazard. Stat Methods Med Res 2021;30:2526-2542. [PMID: 34547928 PMCID: PMC8649433 DOI: 10.1177/09622802211041759] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Finger RP, Daien V, Talks JS, Mitchell P, Wong TY, Sakamoto T, Eldem BM, Lövestam‐Adrian M, Korobelnik J. A novel tool to assess the quality of RWE to guide the management of retinal disease. Acta Ophthalmol 2021;99:604-610. [PMID: 33369881 DOI: 10.1111/aos.14698] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 11/02/2020] [Accepted: 11/03/2020] [Indexed: 12/27/2022]

Dunne J, Tessema GA, Ognjenovic M, Pereira G. Quantifying the influence of bias in reproductive and perinatal epidemiology through simulation. Ann Epidemiol 2021;63:86-101. [PMID: 34384883 DOI: 10.1016/j.annepidem.2021.07.033] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 07/20/2021] [Accepted: 07/31/2021] [Indexed: 11/25/2022]

1,3-Butadiene, styrene and selected outcomes among synthetic rubber polymer workers: Updated exposure-response analyses. Chem Biol Interact 2021;347:109600. [PMID: 34324853 DOI: 10.1016/j.cbi.2021.109600] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 07/14/2021] [Accepted: 07/22/2021] [Indexed: 11/23/2022]

Abstract

OBJECTIVE

- To evaluate exposure-response relationships between 1,3-butadiene and styrene and selected diseases among synthetic rubber polymer workers.

METHODS

- 21,087 workers (16,579 men; 4508 women) were followed from 1943 through 2009 to determine mortality outcomes. Cox regression models estimated rate ratios (RRs) and 95% confidence intervals (CIs) by quartile of cumulative exposure to butadiene or styrene and exposure-response trends for cancers of the bladder, lung, kidney, esophagus and pancreas, and for all nonmalignant respiratory disease (NMRD), chronic obstructive pulmonary disease (COPD) and pneumonia.

RESULTS

- Bladder cancer RRs were 2.13 (95% CI = 1.03 to 4.41) and 1.64 (95% CI = 0.76 to 3.54) in the highest quartiles of cumulative exposure to butadiene and styrene, respectively, and exposure-response trends were positive for both monomers (butadiene, trend p = 0.001; styrene, trend p = 0.004). Further analyses indicated that the exposure-response effect of each monomer on bladder cancer was demonstrated clearly only in the subgroup with high cumulative exposure (at or above the median) to the other monomer. Lung cancer was not associated with either monomer among men. Among women, lung cancer RRs were above 1.0 in each quartile of cumulative exposure to each monomer, but exposure-response was not seen for either monomer. Male workers had COPD RRs slightly above 1.0 in each quartile of cumulative exposure to each monomer, but there was no evidence of exposure-response among the exposed. Monomer exposure was not consistently associated with COPD in women or with the other cancer outcomes.

CONCLUSIONS

- This study found a positive exposure-response relationship between monomer exposures and bladder cancer. The independent effects of butadiene and styrene on this cancer could not be delineated. In some analyses, monomer exposure was associated with lung cancer in women and with COPD in men, but inconsistent exposure-response trends and divergent results by sex do not support a causal interpretation of the isolated positive associations.

Collapse

Buchka S, Hapfelmeier A, Gardner PP, Wilson R, Boulesteix AL. On the optimistic performance evaluation of newly introduced bioinformatic methods. Genome Biol 2021;22:152. [PMID: 33975646 PMCID: PMC8111726 DOI: 10.1186/s13059-021-02365-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Accepted: 04/23/2021] [Indexed: 12/03/2022] Open

Hoffmann S, Schönbrodt F, Elsas R, Wilson R, Strasser U, Boulesteix AL. The multiplicity of analysis strategies jeopardizes replicability: lessons learned across disciplines. ROYAL SOCIETY OPEN SCIENCE 2021;8:201925. [PMID: 33996122 PMCID: PMC8059606 DOI: 10.1098/rsos.201925] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Accepted: 03/22/2021] [Indexed: 05/05/2023]

Schmidt CO, Struckmann S, Enzenbach C, Reineke A, Stausberg J, Damerow S, Huebner M, Schmidt B, Sauerbrei W, Richter A. Facilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in R. BMC Med Res Methodol 2021;21:63. [PMID: 33810787 PMCID: PMC8019177 DOI: 10.1186/s12874-021-01252-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 03/12/2021] [Indexed: 12/21/2022] Open

Sauerbrei W, Bland M, Evans SJW, Riley RD, Royston P, Schumacher M, Collins GS. Doug Altman: Driving critical appraisal and improvements in the quality of methodological and medical research. Biom J 2021;63:226-246. [PMID: 32639065 DOI: 10.1002/bimj.202000053] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 05/20/2020] [Accepted: 06/03/2020] [Indexed: 12/12/2022]

Kragh Andersen P, Pohar Perme M, van Houwelingen HC, Cook RJ, Joly P, Martinussen T, Taylor JMG, Abrahamowicz M, Therneau TM. Analysis of time-to-event for observational studies: Guidance to the use of intensity models. Stat Med 2021;40:185-211. [PMID: 33043497 DOI: 10.1002/sim.8757] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 09/04/2020] [Accepted: 09/04/2020] [Indexed: 12/15/2022]

Goetghebeur E, le Cessie S, De Stavola B, Moodie EEM, Waernbaum I, “on behalf of” the topic group Causal Inference (TG7) of the STRATOS initiative. Formulating causal questions and principled statistical answers. Stat Med 2020;39:4922-4948. [PMID: 32964526 PMCID: PMC7756489 DOI: 10.1002/sim.8741] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2019] [Revised: 05/10/2020] [Accepted: 08/05/2020] [Indexed: 12/13/2022]

Bach P, Wallisch C, Klein N, Hafermann L, Sauerbrei W, Steyerberg EW, Heinze G, Rauch G, for topic group 2 of the STRATOS initiative. Systematic review of education and practical guidance on regression modeling for medical researchers who lack a strong statistical background: Study protocol. PLoS One 2020;15:e0241427. [PMID: 33347441 PMCID: PMC7751867 DOI: 10.1371/journal.pone.0241427] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 10/14/2020] [Indexed: 12/23/2022] Open

Abstract

In the last decades, statistical methodology has developed rapidly, in particular in the field of regression modeling. Multivariable regression models are applied in almost all medical research projects. Therefore, the potential impact of statistical misconceptions within this field can be enormous Indeed, the current theoretical statistical knowledge is not always adequately transferred to the current practice in medical statistics. Some medical journals have identified this problem and published isolated statistical articles and even whole series thereof. In this systematic review, we aim to assess the current level of education on regression modeling that is provided to medical researchers via series of statistical articles published in medical journals. The present manuscript is a protocol for a systematic review that aims to assess which aspects of regression modeling are covered by statistical series published in medical journals that intend to train and guide applied medical researchers with limited statistical knowledge. Statistical paper series cannot easily be summarized and identified by common keywords in an electronic search engine like Scopus. We therefore identified series by a systematic request to statistical experts who are part or related to the STRATOS Initiative (STRengthening Analytical Thinking for Observational Studies). Within each identified article, two raters will independently check the content of the articles with respect to a predefined list of key aspects related to regression modeling. The content analysis of the topic-relevant articles will be performed using a predefined report form to assess the content as objectively as possible. Any disputes will be resolved by a third reviewer. Summary analyses will identify potential methodological gaps and misconceptions that may have an important impact on the quality of analyses in medical research. This review will thus provide a basis for future guidance papers and tutorials in the field of regression modeling which will enable medical researchers 1) to interpret publications in a correct way, 2) to perform basic statistical analyses in a correct way and 3) to identify situations when the help of a statistical expert is required.

Collapse

Affiliation(s)

Paul Bach Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charité - Universitätsmedizin Berlin, Berlin, Germany Berlin Institute of Health (BIH), Berlin, Germany School of Business and Economics, Applied Statistics, Humboldt-Universität zu Berlin, Berlin, Germany
Christine Wallisch Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charité - Universitätsmedizin Berlin, Berlin, Germany Berlin Institute of Health (BIH), Berlin, Germany Section for Clinical Biometrics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Vienna, Austria
Nadja Klein School of Business and Economics, Applied Statistics, Humboldt-Universität zu Berlin, Berlin, Germany
Lorena Hafermann Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charité - Universitätsmedizin Berlin, Berlin, Germany Berlin Institute of Health (BIH), Berlin, Germany
Willi Sauerbrei Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center—University of Freiburg, Freiburg, Germany
Ewout W. Steyerberg Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
Georg Heinze Section for Clinical Biometrics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Vienna, Austria
Geraldine Rauch Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charité - Universitätsmedizin Berlin, Berlin, Germany Berlin Institute of Health (BIH), Berlin, Germany
for topic group 2 of the STRATOS initiative

Collapse

Boulesteix AL, Groenwold RH, Abrahamowicz M, Binder H, Briel M, Hornung R, Morris TP, Rahnenführer J, Sauerbrei W. Introduction to statistical simulations in health research. BMJ Open 2020;10:e039921. [PMID: 33318113 PMCID: PMC7737058 DOI: 10.1136/bmjopen-2020-039921] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Coomans MB, Peeters MC, Koekkoek JA, Schoones JW, Reijneveld J, Taphoorn MJ, Dirven L. Research Objectives, Statistical Analyses and Interpretation of Health-Related Quality of Life Data in Glioma Research: A Systematic Review. Cancers (Basel) 2020;12:E3502. [PMID: 33255505 PMCID: PMC7760401 DOI: 10.3390/cancers12123502] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 11/19/2020] [Accepted: 11/21/2020] [Indexed: 11/16/2022] Open

Dwivedi AK, Shukla R. Evidence-based statistical analysis and methods in biomedical research (SAMBR) checklists according to design features. Cancer Rep (Hoboken) 2020;3:e1211. [PMID: 32794640 PMCID: PMC7941456 DOI: 10.1002/cnr2.1211] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 06/11/2019] [Accepted: 07/16/2019] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

Statistical analysis according to design features and objectives is essential to ensure the validity and reliability of the study findings and conclusions in biomedical research. Heterogeneity in reporting study design elements and conducting statistical analyses is often observed for the same study design and study objective in medical literatures. Sometimes, researchers face a lot of predicaments using appropriate statistical approaches highlighted by methodologists for a specific study design either due to lack of accessibility or understanding of statistical methods or unavailability of checklists related to design and analysis in a concise format. The purpose of this review is to provide the checklist of statistical analysis and methods in biomedical research (SAMBR) to applied researchers.

RECENT FINDINGS

We initially identified the important steps of reporting design features that may influence the choice of statistical analysis in biomedical research and essential steps of data analysis of common studies. We subsequently searched for statistical approaches employed for each study design/study objective available in publications and other resources. Compilation of these steps produced SAMBR guidance document, which includes three parts. Applied researchers can use part (A) and part (B) of SAMBR to describe or evaluate research design features and quality of statistical analysis, respectively, in reviewing studies or designing protocols. Part (C) of SAMBR can be used to perform essential and preferred evidence-based data analysis specific to study design and objective.

CONCLUSIONS

We believe that the statistical methods checklists may improve reporting of research design, standardize methodological practices, and promote consistent application of statistical approaches, thus improving the quality of research studies. The checklists do not enforce the use of suggested statistical methods but rather highlight and encourage to conduct the best statistical practices. There is a need to develop an interactive web-based application of the checklists for users for its wide applications.

Collapse

Shaw PA, Gustafson P, Carroll RJ, Deffner V, Dodd KW, Keogh RH, Kipnis V, Tooze JA, Wallace MP, Küchenhoff H, Freedman LS. STRATOS guidance document on measurement error and misclassification of variables in observational epidemiology: Part 2-More complex methods of adjustment and advanced topics. Stat Med 2020;39:2232-2263. [PMID: 32246531 PMCID: PMC7272296 DOI: 10.1002/sim.8531] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Revised: 02/27/2020] [Accepted: 02/28/2020] [Indexed: 12/24/2022]

Sauerbrei W, Perperoglou A, Schmid M, Abrahamowicz M, Becher H, Binder H, Dunkler D, Harrell FE, Royston P, Heinze G. State of the art in selection of variables and functional forms in multivariable analysis-outstanding issues. Diagn Progn Res 2020;4:3. [PMID: 32266321 PMCID: PMC7114804 DOI: 10.1186/s41512-020-00074-3] [Citation(s) in RCA: 139] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 03/18/2020] [Indexed: 12/18/2022] Open

Abstract

BACKGROUND

How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc 'traditional' approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are scarce. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, many outstanding issues in multivariable modelling remain. Our main aims are to identify and illustrate such gaps in the literature and present them at a moderate technical level to the wide community of practitioners, researchers and students of statistics.

METHODS

We briefly discuss general issues in building descriptive regression models, strategies for variable selection, different ways of choosing functional forms for continuous variables and methods for combining the selection of variables and functions. We discuss two examples, taken from the medical literature, to illustrate problems in the practice of modelling.

RESULTS

Our overview revealed that there is not yet enough evidence on which to base recommendations for the selection of variables and functional forms in multivariable analysis. Such evidence may come from comparisons between alternative methods. In particular, we highlight seven important topics that require further investigation and make suggestions for the direction of further research.

CONCLUSIONS

Selection of variables and of functional forms are important topics in multivariable analysis. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, further comparative research is required.

Collapse

Wang Y, Beauchamp ME, Abrahamowicz M. Nonlinear and time-dependent effects of sparsely measured continuous time-varying covariates in time-to-event analysis. Biom J 2020;62:492-515. [PMID: 32022299 DOI: 10.1002/bimj.201900042] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Revised: 11/18/2019] [Accepted: 11/22/2019] [Indexed: 12/14/2022]

Arisido MW, Antolini L, Bernasconi DP, Valsecchi MG, Rebora P. Joint model robustness compared with the time-varying covariate Cox model to evaluate the association between a longitudinal marker and a time-to-event endpoint. BMC Med Res Methodol 2019;19:222. [PMID: 31795933 PMCID: PMC6888912 DOI: 10.1186/s12874-019-0873-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 11/20/2019] [Indexed: 12/21/2022] Open

Abstract

BACKGROUND

The recent progress in medical research generates an increasing interest in the use of longitudinal biomarkers for characterizing the occurrence of an outcome. The present work is motivated by a study, where the objective was to explore the potential of the long pentraxin 3 (PTX3) as a prognostic marker of Acute Graft-versus-Host Disease (GvHD) after haematopoietic stem cell transplantation. Time-varying covariate Cox model was commonly used, despite its limiting assumptions that marker values are constant in time and measured without error. A joint model has been developed as a viable alternative; however, the approach is computationally intensive and requires additional strong assumptions, in which the impacts of their misspecification were not sufficiently studied.

METHODS

We conduct an extensive simulation to clarify relevant assumptions for the understanding of joint models and assessment of its robustness under key model misspecifications. Further, we characterize the extent of bias introduced by the limiting assumptions of the time-varying covariate Cox model and compare its performance with a joint model in various contexts. We then present results of the two approaches to evaluate the potential of PTX3 as a prognostic marker of GvHD after haematopoietic stem cell transplantation.

RESULTS

Overall, we illustrate that a joint model provides an unbiased estimate of the association between a longitudinal marker and the hazard of an event in the presence of measurement error, showing improvement over the time-varying Cox model. However, a joint model is severely biased when the baseline hazard or the shape of the longitudinal trajectories are misspecified. Both the Cox model and the joint model correctly specified indicated PTX3 as a potential prognostic marker of GvHD, with the joint model providing a higher hazard ratio estimate.

CONCLUSIONS

Joint models are beneficial to investigate the capability of the longitudinal marker to characterize time-to-event endpoint. However, the benefits are strictly linked to the correct specification of the longitudinal marker trajectory and the baseline hazard function, indicating a careful consideration of assumptions to avoid biased estimates.

Collapse