1
|
Tomlinson E, Cooper C, Davenport C, Rutjes AWS, Leeflang M, Mallett S, Whiting P. Common challenges and suggestions for risk of bias tool development: a systematic review of methodological studies. J Clin Epidemiol 2024; 171:111370. [PMID: 38670243 DOI: 10.1016/j.jclinepi.2024.111370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 04/16/2024] [Accepted: 04/17/2024] [Indexed: 04/28/2024]
Abstract
OBJECTIVES To review the findings of studies that have evaluated the design and/or usability of key risk of bias (RoB) tools for the assessment of RoB in primary studies, as categorized by the Library of Assessment Tools and InsTruments Used to assess Data validity in Evidence Synthesis Network (a searchable library of RoB tools for evidence synthesis): Prediction model Risk Of Bias ASessment Tool (PROBAST) , Risk of Bias-2 (RoB2), Risk Of Bias In Non-randomised Studies of Interventions (ROBINS-I), Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2), Quality Assessment of Diagnostic Accuracy Studies-Comparative (QUADAS-C), Quality Assessment of Prognostic Accuracy Studies (QUAPAS), Risk Of Bias in Non-randomised Studies of Exposures (ROBINS-E), and the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) RoB checklist. STUDY DESIGN AND SETTING Systematic review of methodological studies. We conducted a forward citation search from the primary report of each tool, to identify primary studies that aimed to evaluate the design and/or usability of the tool. Two reviewers assessed studies for inclusion. We extracted tool features into Microsoft Word and used NVivo for document analysis, comprising a mix of deductive and inductive approaches. We summarized findings within each tool and explored common findings across tools. RESULTS We identified 13 tool evaluations meeting our inclusion criteria: PROBAST (3), RoB2 (3), ROBINS-I (4), and QUADAS-2 (3). We identified no evaluations for the other tools. Evaluations varied in clinical topic area, methodology, approach to bias assessment, and tool user background. Some had limitations affecting generalizability. We identified common findings across tools for 6/14 themes: (1) challenging items (eg, RoB2/ROBINS-I "deviations from intended interventions" domain), (2) overall RoB judgment (concerns with overall risk calculation in PROBAST/ROBINS-I), (3) tool usability (concerns about complexity), (4) time to complete tool (varying demands on time, eg, depending on number of outcomes assessed), (5) user agreement (varied across tools), and (6) recommendations for future use (eg, piloting) and development (add intermediate domain answer to QUADAS-2/PROBAST; provide clearer guidance for all tools). Of the other eight themes, seven only had findings for the QUADAS-2 tool, limiting comparison across tools, and one ("reorganization of questions") had no findings. CONCLUSION Evaluations of key RoB tools have posited common challenges and recommendations for tool use and development. These findings may be helpful to people who use or develop RoB tools. Guidance is necessary to support the design and implementation of future RoB tool evaluations.
Collapse
Affiliation(s)
- Eve Tomlinson
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
| | - Chris Cooper
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Clare Davenport
- Test and Prediction Group, Institute of Applied Health Research, University of Birmingham, Birmingham, B15 2TT, UK; NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham B15 2TT, UK
| | - Anne W S Rutjes
- Department of Medical and Surgical Sciences for Children and Adults (SMECHIMAI), University of Modena and Reggio Emilia, Modena, Italy
| | - Mariska Leeflang
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | - Sue Mallett
- Centre for Medical Imaging, University College London, London, UK
| | - Penny Whiting
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| |
Collapse
|
2
|
Kuehn R, Wang Y, Guyatt G. Overly complex methods may impair pragmatic use of core evidence-based medicine principles. BMJ Evid Based Med 2024; 29:139-141. [PMID: 38453419 DOI: 10.1136/bmjebm-2024-112868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/22/2024] [Indexed: 03/09/2024]
Affiliation(s)
- Rebecca Kuehn
- Department of Clinical Sciences, Liverpool School of Tropical Medicine, Liverpool, UK
| | - Ying Wang
- Department of Health Research Methods, Evidence, and Impact (HEI), McMaster Univ, Hamilton, Ontario, Canada
| | - Gordon Guyatt
- Department of Health Research Methods, Evidence, and Impact (HEI), McMaster Univ, Hamilton, Ontario, Canada
| |
Collapse
|
3
|
Konno K, Gibbons J, Lewis R, Pullin AS. Potential types of bias when estimating causal effects in environmental research and how to interpret them. ENVIRONMENTAL EVIDENCE 2024; 13:1. [PMID: 39294842 PMCID: PMC11376104 DOI: 10.1186/s13750-024-00324-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 02/01/2024] [Indexed: 09/21/2024]
Abstract
To inform environmental policy and practice, researchers estimate effects of interventions/exposures by conducting primary research (e.g., impact evaluations) or secondary research (e.g., evidence reviews). If these estimates are derived from poorly conducted/reported research, then they could misinform policy and practice by providing biased estimates. Many types of bias have been described, especially in health and medical sciences. We aimed to map all types of bias from the literature that are relevant to estimating causal effects in the environmental sector. All the types of bias were initially identified by using the Catalogue of Bias (catalogofbias.org) and reviewing key publications (n = 11) that previously collated and described biases. We identified 121 (out of 206) types of bias that were relevant to estimating causal effects in the environmental sector. We provide a general interpretation of every relevant type of bias covered by seven risk-of-bias domains for primary research: risk of confounding biases; risk of post-intervention/exposure selection biases; risk of misclassified/mismeasured comparison biases; risk of performance biases; risk of detection biases; risk of outcome reporting biases; risk of outcome assessment biases, and four domains for secondary research: risk of searching biases; risk of screening biases; risk of study appraisal and data coding/extraction biases; risk of data synthesis biases. Our collation should help scientists and decision makers in the environmental sector be better aware of the nature of bias in estimation of causal effects. Future research is needed to formalise the definitions of the collated types of bias such as through decomposition using mathematical formulae.
Collapse
Affiliation(s)
- Ko Konno
- School of Natural Sciences, Bangor University, Bangor, UK.
| | - James Gibbons
- School of Natural Sciences, Bangor University, Bangor, UK
| | - Ruth Lewis
- School of Medical and Health Sciences, Bangor University, Bangor, UK
| | | |
Collapse
|
4
|
Al Masri A, Schiffner U, Mourad MS, Schmoeckel J, Joseph P, Splieth CH. The impact of bias of underlying literature in guidelines on its recommendations: assessment of the German fluoride guideline. Eur Arch Paediatr Dent 2024; 25:65-73. [PMID: 38007707 PMCID: PMC10942900 DOI: 10.1007/s40368-023-00854-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 10/20/2023] [Indexed: 11/28/2023]
Abstract
PURPOSE The significance of the underlying literature in clinical guidelines can be weakened by the risk of bias, which could negatively affect the recommendations. Especially in controversial matters, such as fluoride use for caries prevention in children, biased results may be not reliable and lead to incorrect conclusions. This study was performed to detect bias in underlying literature of the German guideline for caries prevention using fluoride in children, where no consensus was reached between paediatricians and paediatric dentists. METHODS Three tools used for risk of bias assessments of different study designs were RoB 2 for RCTs, ROBINS-I for non-randomized studies, and ROBIS for systematic reviews. For each study cited in the guideline two independent risk of bias assessments were performed. Disagreements were resolved by consensus. RESULTS Out of 58 papers, 48.3% (n = 28) showed high risk of bias, with the majority in sections regarding fluoride tablets, fluoridated toothpaste, and paediatricians' recommendations. 9 out of 20 recommendations and statements were based on studies with high risk of bias, all of which were in these three controversial sections. 13 out of 29 RCTs showed high risk of bias (44.8%), as all 13 non-randomized trials did, while only 2 of 16 (12.5%) systematic reviews had high risk of bias. CONCLUSION Considering risk of bias of cited studies in clinical guidelines may result in substantial changes in its recommendations and aid in reaching consensus. Efforts should be made to assess risk of bias of underlying literature in future clinical guidelines.
Collapse
Affiliation(s)
- A Al Masri
- Department of Preventive and Pediatric Dentistry, Greifswald University Dental Clinics, Walther-Rathenau-Straße 42a, 17475, Greifswald, Germany.
| | - U Schiffner
- Department for Periodontology, Preventive and Restorative Dentistry, Center for Dental and Oral Medicine, University Medical Center Hamburg-Eppendorf (UKE), Martinistraße 52, 20246, Hamburg, Germany
| | - M S Mourad
- Department of Preventive and Pediatric Dentistry, Greifswald University Dental Clinics, Walther-Rathenau-Straße 42a, 17475, Greifswald, Germany
- Department of Orthodontics, Greifswald University Dental Clinics, Walther-Rathenau-Straße 42a, 17475, Greifswald, Germany
| | - J Schmoeckel
- Department of Preventive and Pediatric Dentistry, Greifswald University Dental Clinics, Walther-Rathenau-Straße 42a, 17475, Greifswald, Germany
| | - P Joseph
- Department of Preventive and Pediatric Dentistry, Greifswald University Dental Clinics, Walther-Rathenau-Straße 42a, 17475, Greifswald, Germany
| | - C H Splieth
- Department of Preventive and Pediatric Dentistry, Greifswald University Dental Clinics, Walther-Rathenau-Straße 42a, 17475, Greifswald, Germany
| |
Collapse
|
5
|
Gawlik A, Lüdemann J, Neuhausen A, Zepp C, Vitinius F, Kleinert J. A Systematic Review of Workplace Physical Activity Coaching. JOURNAL OF OCCUPATIONAL REHABILITATION 2023; 33:550-569. [PMID: 36849840 PMCID: PMC10495277 DOI: 10.1007/s10926-023-10093-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 01/09/2023] [Indexed: 06/18/2023]
Abstract
Aim Studies show that about 60 min of moderate physical activity (PA) per day compensate for sitting all day at work. However, the workplace offers an ideal setting for health-promoting interventions such as PA coaching as a person-centered intervention aimed at achieving lasting health behavior changes. Given a good evidence base of health coaching studies in general, this systematic review aims to provide an overview of workplace PA coaching interventions. Methods This review was conducted according to PRISMA guidelines. Studies published up to July 2021 were considered based on the following inclusion criteria: (1) longitudinal intervention studies, (2) analysis of PA at work, (3) sedentary employees, (4) PA coaching in the workplace as intervention, (5) increasing workplace PA. Results Of 4323 studies found, 14 studies with 17 interventions met inclusion criteria. All 17 interventions indicated an increase in at least one PA outcome. Twelve interventions indicated significant improvements in at least one workplace or total PA outcome. There is a high variation within the different coaching parameters, such as behavior change techniques and communication channels. The study quality showed a moderate to high risk of bias. Conclusions The majority of interventions provided evidence for the effectiveness of workplace PA coaching. Nevertheless, the results are inconclusive with regard to the variety of coaching parameters and thus no general statement can be made about the effectiveness of individual parameters. However, this variety of parameters also leads to a high degree of individualization of workplace PA coaching interventions to increase PA for different groups of employees and different types of workplaces.
Collapse
Affiliation(s)
- A Gawlik
- Department of Health and Social Psychology, Institute of Psychology, German Sport University Cologne, Cologne, Germany.
| | - J Lüdemann
- Department of Health and Social Psychology, Institute of Psychology, German Sport University Cologne, Cologne, Germany
| | - A Neuhausen
- Department of Psychosomatics and Psychotherapy, Faculty of Medicine, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - C Zepp
- Department of Health and Social Psychology, Institute of Psychology, German Sport University Cologne, Cologne, Germany
| | - F Vitinius
- Department of Psychosomatics and Psychotherapy, Faculty of Medicine, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - J Kleinert
- Department of Health and Social Psychology, Institute of Psychology, German Sport University Cologne, Cologne, Germany
| |
Collapse
|
6
|
Kaiser I, Pfahlberg AB, Mathes S, Uter W, Diehl K, Steeb T, Heppt MV, Gefeller O. Inter-Rater Agreement in Assessing Risk of Bias in Melanoma Prediction Studies Using the Prediction Model Risk of Bias Assessment Tool (PROBAST): Results from a Controlled Experiment on the Effect of Specific Rater Training. J Clin Med 2023; 12:jcm12051976. [PMID: 36902763 PMCID: PMC10003882 DOI: 10.3390/jcm12051976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 02/27/2023] [Accepted: 02/28/2023] [Indexed: 03/06/2023] Open
Abstract
Assessing the risk of bias (ROB) of studies is an important part of the conduct of systematic reviews and meta-analyses in clinical medicine. Among the many existing ROB tools, the Prediction Model Risk of Bias Assessment Tool (PROBAST) is a rather new instrument specifically designed to assess the ROB of prediction studies. In our study we analyzed the inter-rater reliability (IRR) of PROBAST and the effect of specialized training on the IRR. Six raters independently assessed the risk of bias (ROB) of all melanoma risk prediction studies published until 2021 (n = 42) using the PROBAST instrument. The raters evaluated the ROB of the first 20 studies without any guidance other than the published PROBAST literature. The remaining 22 studies were assessed after receiving customized training and guidance. Gwet's AC1 was used as the primary measure to quantify the pairwise and multi-rater IRR. Depending on the PROBAST domain, results before training showed a slight to moderate IRR (multi-rater AC1 ranging from 0.071 to 0.535). After training, the multi-rater AC1 ranged from 0.294 to 0.780 with a significant improvement for the overall ROB rating and two of the four domains. The largest net gain was achieved in the overall ROB rating (difference in multi-rater AC1: 0.405, 95%-CI 0.149-0.630). In conclusion, without targeted guidance, the IRR of PROBAST is low, questioning its use as an appropriate ROB instrument for prediction studies. Intensive training and guidance manuals with context-specific decision rules are needed to correctly apply and interpret the PROBAST instrument and to ensure consistency of ROB ratings.
Collapse
Affiliation(s)
- Isabelle Kaiser
- Department of Medical Informatics, Biometry and Epidemiology, Friedrich Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Germany
- Correspondence:
| | - Annette B. Pfahlberg
- Department of Medical Informatics, Biometry and Epidemiology, Friedrich Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Germany
| | - Sonja Mathes
- Department of Dermatology and Allergy Biederstein, Faculty of Medicine, Technical University of Munich, 80802 Munich, Germany
| | - Wolfgang Uter
- Department of Medical Informatics, Biometry and Epidemiology, Friedrich Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Germany
| | - Katharina Diehl
- Department of Medical Informatics, Biometry and Epidemiology, Friedrich Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Germany
| | - Theresa Steeb
- Department of Dermatology, University Hospital Erlangen, 91054 Erlangen, Germany
| | - Markus V. Heppt
- Department of Dermatology, University Hospital Erlangen, 91054 Erlangen, Germany
- Comprehensive Cancer Center Erlangen-European Metropolitan Area of Nuremberg (CCC ER-EMN), 91054 Erlangen, Germany
| | - Olaf Gefeller
- Department of Medical Informatics, Biometry and Epidemiology, Friedrich Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Germany
| |
Collapse
|
7
|
Sargeant JM, Brennan ML, O'Connor AM. Levels of Evidence, Quality Assessment, and Risk of Bias: Evaluating the Internal Validity of Primary Research. Front Vet Sci 2022; 9:960957. [PMID: 35903128 PMCID: PMC9315339 DOI: 10.3389/fvets.2022.960957] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 06/24/2022] [Indexed: 12/27/2022] Open
Abstract
Clinical decisions in human and veterinary medicine should be based on the best available evidence. The results of primary research are an important component of that evidence base. Regardless of whether assessing studies for clinical case management, developing clinical practice guidelines, or performing systematic reviews, evidence from primary research should be evaluated for internal validity i.e., whether the results are free from bias (reflect the truth). Three broad approaches to evaluating internal validity are available: evaluating the potential for bias in a body of literature based on the study designs employed (levels of evidence), evaluating whether key study design features associated with the potential for bias were employed (quality assessment), and applying a judgement as to whether design elements of a study were likely to result in biased results given the specific context of the study (risk of bias assessment). The level of evidence framework for assessing internal validity assumes that internal validity can be determined based on the study design alone, and thus makes the strongest assumptions. Risk of bias assessments involve an evaluation of the potential for bias in the context of a specific study, and thus involve the least assumptions about internal validity. Quality assessment sits somewhere between the assumptions of these two. Because risk of bias assessment involves the least assumptions, this approach should be used to assess internal validity where possible. However, risk of bias instruments are not available for all study designs, some clinical questions may be addressed using multiple study designs, and some instruments that include an evaluation of internal validity also include additional components (e.g., evaluation of comprehensiveness of reporting, assessments of feasibility or an evaluation of external validity). Therefore, it may be necessary to embed questions related to risk of bias within existing quality assessment instruments. In this article, we overview the approaches to evaluating internal validity, highlight the current complexities, and propose ideas for approaching assessments of internal validity.
Collapse
Affiliation(s)
- Jan M. Sargeant
- Department of Population Medicine, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada
- *Correspondence: Jan M. Sargeant
| | - Marnie L. Brennan
- Centre for Evidence-Based Veterinary Medicine, School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington Campus, Loughborough, United Kingdom
| | - Annette M. O'Connor
- Department of Large Animal Clinical Sciences, College of Veterinary Medicine, Michigan State University, East Lansing, MI, United States
| |
Collapse
|
8
|
Luijken K, van de Wall BJM, Hooft L, Leenen LPH, Houwert RM, Groenwold RHH. How to assess applicability and methodological quality of comparative studies of operative interventions in orthopedic trauma surgery. Eur J Trauma Emerg Surg 2022; 48:4943-4953. [PMID: 35809102 DOI: 10.1007/s00068-022-02031-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 06/05/2022] [Indexed: 11/03/2022]
Abstract
PURPOSE It is challenging to generate and subsequently implement high-quality evidence in surgical practice. A first step would be to grade the strengths and weaknesses of surgical evidence and appraise risk of bias and applicability. Here, we described items that are common to different risk-of-bias tools. We explained how these could be used to assess comparative operative intervention studies in orthopedic trauma surgery, and how these relate to applicability of results. METHODS We extracted information from the Cochrane risk-of-bias-2 (RoB-2) tool, Risk Of Bias In Non-randomised Studies-of Interventions tool (ROBINS-I), and Methodological Index for Non-Randomized Studies (MINORS) criteria and derived a concisely formulated set of items with signaling questions tailored to operative interventions in orthopedic trauma surgery. RESULTS The established set contained nine items: population, intervention, comparator, outcome, confounding, missing data and selection bias, intervention status, outcome assessment, and pre-specification of analysis. Each item can be assessed using signaling questions and was explained using good practice examples of operative intervention studies in orthopedic trauma surgery. CONCLUSION The set of items will be useful to form a first judgment on studies, for example when including them in a systematic review. Existing risk of bias tools can be used for further evaluation of methodological quality. Additionally, the proposed set of items and signaling questions might be a helpful starting point for peer reviewers and clinical readers.
Collapse
Affiliation(s)
- Kim Luijken
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands.
| | - Bryan J M van de Wall
- Department of Orthopedic and Trauma Surgery, Cantonal Hospital of Lucerne, Lucerne, Switzerland.,Department of Health Sciences and Medicine, University of Lucerne, Lucerne, Switzerland
| | - Lotty Hooft
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, The Netherlands.,Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Luke P H Leenen
- Department of Trauma Surgery, University Medical Center Utrecht, Utrecht, The Netherlands
| | - R Marijn Houwert
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands.,Department of Trauma Surgery, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Rolf H H Groenwold
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands.,Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | | |
Collapse
|
9
|
Igelström E, Campbell M, Craig P, Katikireddi SV. Cochrane's risk of bias tool for non-randomized studies (ROBINS-I) is frequently misapplied: A methodological systematic review. J Clin Epidemiol 2021; 140:22-32. [PMID: 34437948 PMCID: PMC8809341 DOI: 10.1016/j.jclinepi.2021.08.022] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 08/16/2021] [Accepted: 08/18/2021] [Indexed: 12/20/2022]
Abstract
OBJECTIVES We aimed to review how 'Risk of Bias In Non-randomized Studies-of Interventions' (ROBINS-I), a Cochrane risk of bias assessment tool, has been used in recent systematic reviews. STUDY DESIGN AND SETTING Database and citation searches were conducted in March 2020 to identify recently published reviews using ROBINS-I. Reported ROBINS-I assessments and data on how ROBINS-I was used were extracted from each review. Methodological quality of reviews was assessed using AMSTAR 2 ('A MeaSurement Tool to Assess systematic Reviews'). RESULTS Of 181 hits, 124 reviews were included. Risk of bias was serious/critical in 54% of assessments on average, most commonly due to confounding. Quality of reviews was mostly low, and modifications and incorrect use of ROBINS-I were common, with 20% reviews modifying the rating scale, 20% understating overall risk of bias, and 19% including critical-risk of bias studies in evidence synthesis. Poorly conducted reviews were more likely to report low/moderate risk of bias (predicted probability 57% [95% CI: 47-67] in critically low-quality reviews, 31% [19-46] in high/moderate-quality reviews). CONCLUSION Low-quality reviews frequently apply ROBINS-I incorrectly, and may thus inappropriately include or give too much weight to uncertain evidence. Readers should be aware that such problems can lead to incorrect conclusions in reviews.
Collapse
Affiliation(s)
- Erik Igelström
- MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Berkeley Square 99 Berkeley Street, Glasgow, G3 7HR.
| | - Mhairi Campbell
- MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Berkeley Square 99 Berkeley Street, Glasgow, G3 7HR
| | - Peter Craig
- MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Berkeley Square 99 Berkeley Street, Glasgow, G3 7HR
| | - Srinivasa Vittal Katikireddi
- MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Berkeley Square 99 Berkeley Street, Glasgow, G3 7HR
| |
Collapse
|
10
|
Kelly SE, Greene-Finestone LS, Yetley EA, Benkhedda K, Brooks SPJ, Wells GA, MacFarlane AJ. NUQUEST-NUtrition QUality Evaluation Strengthening Tools: development of tools for the evaluation of risk of bias in nutrition studies. Am J Clin Nutr 2021; 115:256-271. [PMID: 34605544 PMCID: PMC8755056 DOI: 10.1093/ajcn/nqab335] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 09/29/2021] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Dietary exposure assessments are a critical issue in evaluating human nutrition studies; however, nutrition-specific criteria are not consistently included in existing bias assessment tools. OBJECTIVES Our objective was to develop a set of risk of bias (RoB) tools that integrated nutrition-specific criteria into validated generic assessment tools to address RoB issues, including those specific to dietary exposure assessment. METHODS The Nutrition QUality Evaluation Strengthening Tools (NUQUEST) development and validation process included 8 steps. The first steps identified 1) a development strategy; 2) generic assessment tools with demonstrated validity; and 3) nutrition-specific appraisal issues. This was followed by 4) generation of nutrition-specific items and 5) development of guidance to aid users of NUQUEST. The final steps used established ratings of selected studies and feedback from independent raters to 6) assess reliability and validity; 7) assess formatting and usability; and 8) finalize NUQUEST. RESULTS NUQUEST is based on the Scottish Intercollegiate Guidelines Network checklists for randomized controlled trials, cohort studies, and case-control studies. Using a purposive sample of 45 studies representing the 3 study designs, interrater reliability was high (Cohen's κ: 0.73; 95% CI: 0.52, 0.93) across all tools and at least moderate for individual tools (range: 0.57-1.00). The use of a worksheet improved usability and consistency of overall interrater agreement across all study designs (40% without worksheet, 80%-100% with worksheet). When compared to published ratings, NUQUEST ratings for evaluated studies demonstrated high concurrent validity (93% perfect or near-perfect agreement). Where there was disagreement, the nutrition-specific component was a contributing factor in discerning exposure methodological issues. CONCLUSIONS NUQUEST integrates nutrition-specific criteria with generic criteria from assessment tools with demonstrated reliability and validity. NUQUEST represents a consistent and transparent approach for evaluating RoB issues related to dietary exposure assessment commonly encountered in human nutrition studies.
Collapse
Affiliation(s)
- Shannon E Kelly
- Cardiovascular Research Methods Centre, University of Ottawa Heart Institute, Ottawa, Ontario, Canada
| | | | | | - Karima Benkhedda
- Bureau of Nutritional Sciences, Health Canada, Ottawa, Ontario, Canada
| | - Stephen P J Brooks
- Bureau of Nutritional Sciences, Health Canada, Ottawa, Ontario, Canada,Department of Biology, Carleton University, Ottawa, Ontario, Canada
| | - George A Wells
- Cardiovascular Research Methods Centre, University of Ottawa Heart Institute, Ottawa, Ontario, Canada
| | | |
Collapse
|
11
|
Easter C, Thompson JA, Eldridge S, Taljaard M, Hemming K. Cluster randomized trials of individual-level interventions were at high risk of bias. J Clin Epidemiol 2021; 138:49-59. [PMID: 34197941 PMCID: PMC8592576 DOI: 10.1016/j.jclinepi.2021.06.021] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 06/12/2021] [Accepted: 06/22/2021] [Indexed: 11/16/2022]
Abstract
OBJECTIVES To describe the prevalence of risks of bias in cluster-randomized trials of individual-level interventions, according to the Cochrane Risk of Bias tool. STUDY DESIGN AND SETTING Review undertaken in duplicate of a random sample of 40 primary reports of cluster-randomized trials of individual-level interventions. RESULTS The most common reported reasons for adopting cluster randomization were the need to avoid contamination (17, 42.5%) and practical considerations (14, 35%). Of the 40 trials all but one was assessed as being at risk of bias. A majority (27, 67.5%) were assessed as at risk due to the timing of identification and recruitment of participants; many (21, 52.5%) due to an apparent lack of adequate allocation concealment; and many due to selectively reported results (22, 55%), arising from a mixture of reasons including lack of documentation of primary outcome. Other risks mostly occurred infrequently. CONCLUSION Many cluster-randomized trials evaluating individual-level interventions appear to be at risk of bias, mostly due to identification and recruitment biases. We recommend that investigators carefully consider the need for cluster randomization; follow recommended procedures to mitigate risks of identification and recruitment bias; and adhere to good reporting practices including clear documentation of primary outcome and allocation concealment methods.
Collapse
Affiliation(s)
- Christina Easter
- Institute of Applied Health Research, University of Birmingham, Birmingham, UK
| | - Jennifer A Thompson
- Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK
| | - Sandra Eldridge
- Centre for Clinical Trials and Methodology, Queen Mary University of London, London
| | - Monica Taljaard
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada; School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
| | - Karla Hemming
- Institute of Applied Health Research, University of Birmingham, Birmingham, UK.
| |
Collapse
|
12
|
Stone JC, Gurunathan U, Aromataris E, Glass K, Tugwell P, Munn Z, Doi SAR. Bias Assessment in Outcomes Research: The Role of Relative Versus Absolute Approaches. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2021; 24:1145-1149. [PMID: 34372980 DOI: 10.1016/j.jval.2021.02.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Revised: 02/03/2021] [Accepted: 02/15/2021] [Indexed: 06/13/2023]
Abstract
OBJECTIVES Bias assessment tools vary in content and detail, and the method used for assessment may produce different assessment results in a study if not carefully considered. Therefore, taking an approach to the assessment of studies that produces a similar result regardless of the tool used for assessment (tool independence) is important. METHODS A preexisting study that used 25 different quality scales was assessed to examine tool dependence of 2 common approaches to bias assessments-absolute value judgments (defined as the qualitative risk of bias judgment based on a threshold across studies) and relative ranks (defined as the relative probability toward bias of a study relative to the best assessed study). Agreement between each of the 25 scales and a composite scale (that includes all unique safeguards across all scales) was computed (using the intraclass correlation coefficient [ICC]; consistency). Tool dependence was considered present when the ICCs were inconsistent across the 25 scales for the same study. RESULTS We found that using relative ranks for tools with different numbers and types of items produced consistent results, with only small differences in the agreement for the various tools with the composite tool, whereas consistency (measured by the ICC) varied considerably when using absolute judgments. Inconsistency is problematic because it means that the assessment result is linked to the scale and not to the study. CONCLUSIONS Tool independence is an important attribute of a bias assessment tool. On the basis of this study, the use of relative ranks retains tool independence and therefore produces consistent ranks for the same study across tools.
Collapse
Affiliation(s)
- Jennifer C Stone
- Department of Health Services Research and Policy, Research School of Population Health, Australian National University, Canberra, ACT, Australia and Systematic Review Centre for Laboratory Animal Experimentation (SYRCLE), Department for Health Evidence, Radboud University Medical Center, Nijmegen, The Netherlands.
| | - Usha Gurunathan
- Department of Anaesthesia, The Prince Charles Hospital, Brisbane, Queensland, Australia
| | | | - Kathryn Glass
- National Centre for Epidemiology and Population Health, Research School of Population Health, Australian National University, Canberra, ACT, Australia
| | - Peter Tugwell
- Department of Medicine, University of Ottawa, Ottawa, Canada
| | - Zachary Munn
- JBI, The University of Adelaide, Adelaide, Australia
| | - Suhail A R Doi
- Department of Population Medicine, College of Medicine, QU Health, Qatar University, Doha, Qatar.
| |
Collapse
|
13
|
Jeyaraman MM, Robson RC, Copstein L, Al-Yousif N, Pollock M, Xia J, Balijepalli C, Hofer K, Mansour S, Fazeli MS, Ansari MT, Tricco AC, Rabbani R, Abou-Setta AM. Customized guidance/training improved the psychometric properties of methodologically rigorous risk of bias instruments for non-randomized studies. J Clin Epidemiol 2021; 136:157-167. [PMID: 33979663 DOI: 10.1016/j.jclinepi.2021.04.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 04/09/2021] [Accepted: 04/24/2021] [Indexed: 12/26/2022]
Abstract
OBJECTIVES To evaluate the impact of guidance and training on the inter-rater reliability (IRR), inter-consensus reliability (ICR) and evaluator burden of the Risk of Bias (RoB) in Non-randomized Studies (NRS) of Interventions (ROBINS-I) tool, and the RoB instrument for NRS of Exposures (ROB-NRSE). STUDY DESIGN AND SETTING In a before-and-after study, seven reviewers appraised the RoB using ROBINS-I (n = 44) and ROB-NRSE (n = 44), before and after guidance and training. We used Gwet's AC1 statistic to calculate IRR and ICR. RESULTS After guidance and training, the IRR and ICR of the overall bias domain of ROBINS-I and ROB-NRSE improved significantly; with many individual domains showing either a significant (IRR and ICR of ROB-NRSE; ICR of ROBINS-I), or nonsignificant improvement (IRR of ROBINS-I). Evaluator burden significantly decreased after guidance and training for ROBINS-I, whereas for ROB-NRSE there was a slight nonsignificant increase. CONCLUSION Overall, there was benefit for guidance and training for both tools. We highly recommend guidance and training to reviewers prior to RoB assessments and that future research investigate aspects of guidance and training that are most effective.
Collapse
Affiliation(s)
- Maya M Jeyaraman
- George & Fay Yee Center for Healthcare Innovation, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba. R3E 0T6, Canada; Department of Community Health Sciences, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba, R3E 0T6, Canada.
| | - Reid C Robson
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, 209 Victoria St, Toronto, Ontario, M5B 1T8, Canada
| | - Leslie Copstein
- George & Fay Yee Center for Healthcare Innovation, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba. R3E 0T6, Canada
| | - Nameer Al-Yousif
- George & Fay Yee Center for Healthcare Innovation, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba. R3E 0T6, Canada
| | - Michelle Pollock
- Institute of Health Economics, 1200-10405 Jasper Avenue, Edmonton, Alberta, T5J 3N4, Canada
| | - Jun Xia
- Division of Epidemiology and Public Health, School of Medicine, University of Nottingham Medical School, Nottingham, NG7 2UH, UK; Nottingham Ningbo GRADE Centre, The University of Nottingham Ningbo, 199 East Taikang Road, Ningbo, China
| | | | - Kimberly Hofer
- Evidinno Outcomes Research Inc., 1750 Davie Street, Suites 601 & 602, Vancouver, British Columbia, V6B 2Z4, Canada
| | - Samer Mansour
- Centre Hospitalier de l'Université de Montreal, 2900, boul. Édouard-Montpetit, Montréal (Québec) H3T 1J4, Canada; Faculty of Medicine, Department of Medicine, Université de Montréal, Roger-Gaudry Building, 2900 Edouard Montpetit Blvd, Montreal, Quebec H3T 1J4, Canada; Centre de recherche du Centre Hospitalier de l'Université de Montréal, 900 St Denis St, Montreal, Quebec H2 × 0A9, Canada
| | - Mir S Fazeli
- Evidinno Outcomes Research Inc., 1750 Davie Street, Suites 601 & 602, Vancouver, British Columbia, V6B 2Z4, Canada
| | - Mohammed T Ansari
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Room 101, 600 Peter Morand Crescent, Ottawa, Ontario, K1G 5Z3, Canada
| | - Andrea C Tricco
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, 209 Victoria St, Toronto, Ontario, M5B 1T8, Canada; Epidemiology Division & Institute of Health, Management, and Policy Evaluation, Dalla Lana School of Public Health, University of Toronto, 27 King's College Circle, Toronto, Ontario, M5S 1A1, Canada; Queen's Collaboration for Health Care Quality Joanna Briggs Institute Centre of Excellence, Queen's University, 92 Barrie Street, Room 214, Kingston, Ontario, K7L 3N6, Canada
| | - Rasheda Rabbani
- George & Fay Yee Center for Healthcare Innovation, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba. R3E 0T6, Canada; Department of Community Health Sciences, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba, R3E 0T6, Canada
| | - Ahmed M Abou-Setta
- George & Fay Yee Center for Healthcare Innovation, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba. R3E 0T6, Canada; Department of Community Health Sciences, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba, R3E 0T6, Canada
| |
Collapse
|
14
|
Franco JVA, Meza N. Authors should also report the support for judgment when applying AMSTAR 2. J Clin Epidemiol 2021; 138:240. [PMID: 33774139 DOI: 10.1016/j.jclinepi.2021.02.029] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 02/24/2021] [Indexed: 01/08/2023]
Affiliation(s)
| | - Nicolas Meza
- Interdisciplinary Centre for Health Studies (CIESAL), Universidad de Valparaíso, Cochrane Chile Associate Centre,Viña del Mar, Chile
| |
Collapse
|
15
|
Methodologically rigorous risk of bias tools for nonrandomized studies had low reliability and high evaluator burden. J Clin Epidemiol 2020; 128:140-147. [DOI: 10.1016/j.jclinepi.2020.09.033] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 09/09/2020] [Accepted: 09/22/2020] [Indexed: 12/22/2022]
|
16
|
Minozzi S, Cinquini M, Gianola S, Gonzalez-Lorenzo M, Banzi R. The revised Cochrane risk of bias tool for randomized trials (RoB 2) showed low interrater reliability and challenges in its application. J Clin Epidemiol 2020; 126:37-44. [DOI: 10.1016/j.jclinepi.2020.06.015] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 05/27/2020] [Accepted: 06/15/2020] [Indexed: 11/29/2022]
|
17
|
Jeyaraman MM, Rabbani R, Al-Yousif N, Robson RC, Copstein L, Xia J, Pollock M, Mansour S, Ansari MT, Tricco AC, Abou-Setta AM. Inter-rater reliability and concurrent validity of ROBINS-I: protocol for a cross-sectional study. Syst Rev 2020; 9:12. [PMID: 31931871 PMCID: PMC6958722 DOI: 10.1186/s13643-020-1271-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 01/06/2020] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The Cochrane Bias Methods Group recently developed the "Risk of Bias (ROB) in Non-randomized Studies of Interventions" (ROBINS-I) tool to assess ROB for non-randomized studies of interventions (NRSI). It is important to establish consistency in its application and interpretation across review teams. In addition, it is important to understand if specialized training and guidance will improve the reliability of the results of the assessments. Therefore, the objective of this cross-sectional study is to establish the inter-rater reliability (IRR), inter-consensus reliability (ICR), and concurrent validity of ROBINS-I. Furthermore, as this is a relatively new tool, it is important to understand the barriers to using this tool (e.g., time to conduct assessments and reach consensus-evaluator burden). METHODS Reviewers from four participating centers will appraise the ROB of a sample of NRSI publications using the ROBINS-I tool in two stages. For IRR and ICR, two pairs of reviewers will assess the ROB for each NRSI publication. In the first stage, reviewers will assess the ROB without any formal guidance. In the second stage, reviewers will be provided customized training and guidance. At each stage, each pair of reviewers will resolve conflicts and arrive at a consensus. To calculate the IRR and ICR, we will use Gwet's AC1 statistic. For concurrent validity, reviewers will appraise a sample of NRSI publications using both the New-castle Ottawa Scale (NOS) and ROBINS-I. We will analyze the concordance between the two tools for similar domains and for the overall judgments using Kendall's tau coefficient. To measure the evaluator burden, we will assess the time taken to apply the ROBINS-I (without and with guidance), and the NOS. To assess the impact of customized training and guidance on the evaluator burden, we will use the generalized linear models. We will use Microsoft Excel and SAS 9.4 to manage and analyze study data, respectively. DISCUSSION The quality of evidence from systematic reviews that include NRS depends partly on the study-level ROB assessments. The findings of this study will contribute to an improved understanding of the ROBINS-I tool and how best to use it.
Collapse
Affiliation(s)
- Maya M Jeyaraman
- The George and Fay Yee Center for Healthcare Innovation, University of Manitoba, 753 McDermot Avenue, Winnipeg, MB, R3E 0T6, Canada. .,Department of Community Health Sciences, University of Manitoba, Winnipeg, Canada.
| | - Rasheda Rabbani
- The George and Fay Yee Center for Healthcare Innovation, University of Manitoba, 753 McDermot Avenue, Winnipeg, MB, R3E 0T6, Canada.,Department of Community Health Sciences, University of Manitoba, Winnipeg, Canada
| | - Nameer Al-Yousif
- The George and Fay Yee Center for Healthcare Innovation, University of Manitoba, 753 McDermot Avenue, Winnipeg, MB, R3E 0T6, Canada
| | - Reid C Robson
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, Canada
| | - Leslie Copstein
- The George and Fay Yee Center for Healthcare Innovation, University of Manitoba, 753 McDermot Avenue, Winnipeg, MB, R3E 0T6, Canada
| | - Jun Xia
- Nottingham Ningbo GRADE Centre, The University of Nottingham Ningbo, Ningbo, China
| | | | - Samer Mansour
- Centre Hospitalier de l'Université de Montreal, Quebec, Montreal, Canada.,Faculty of Medicine, Department of Medicine, Université de Montréal, Quebec, Montreal, Canada.,Centre de recherche du Centre Hospitalier de l'Université de Montréal, Quebec, Montreal, Canada
| | - Mohammed T Ansari
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Andrea C Tricco
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, Canada.,Epidemiology Division, Dalla Lana School of Public Health and Institute of Health, Management, and Policy Evaluation, University of Toronto, Toronto, Canada.,Queen's Collaboration for Health Care Quality Joanna Briggs Institute Centre of Excellence, Queen's University, Kingston, Ontario, Canada
| | - Ahmed M Abou-Setta
- The George and Fay Yee Center for Healthcare Innovation, University of Manitoba, 753 McDermot Avenue, Winnipeg, MB, R3E 0T6, Canada.,Department of Community Health Sciences, University of Manitoba, Winnipeg, Canada
| |
Collapse
|