101
|
Craig L, Sims R, Glasziou P, Thomas R. Women's experiences of a diagnosis of gestational diabetes mellitus: a systematic review. BMC Pregnancy Childbirth 2020; 20:76. [PMID: 32028931 PMCID: PMC7006162 DOI: 10.1186/s12884-020-2745-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 01/15/2020] [Indexed: 12/17/2022] Open
Abstract
Background Gestational diabetes mellitus (GDM) - a transitory form of diabetes induced by pregnancy - has potentially important short and long-term health consequences for both the mother and her baby. There is no globally agreed definition of GDM, but definition changes have increased the incidence in some countries in recent years, with some research suggesting minimal clinical improvement in outcomes. The aim of this qualitative systematic review was to identify the psychosocial experiences a diagnosis of GDM has on women during pregnancy and the postpartum period. Methods We searched CINAHL, EMBASE, MEDLINE and PsycINFO databases for studies that provided qualitative data on the psychosocial experiences of a diagnosis of GDM on women across any stage of pregnancy and/or the postpartum period. We appraised the methodological quality of the included studies using the Critical Appraisal Skills Programme Checklist for Qualitative Studies and used thematic analysis to synthesis the data. Results Of 840 studies identified, 41 studies of diverse populations met the selection criteria. The synthesis revealed eight key themes: initial psychological impact; communicating the diagnosis; knowledge of GDM; risk perception; management of GDM; burden of GDM; social support; and gaining control. The identified benefits of a GDM diagnosis were largely behavioural and included an opportunity to make healthy eating changes. The identified harms were emotional, financial and cultural. Women commented about the added responsibility (eating regimens, appointments), financial constraints (expensive food, medical bills) and conflicts with their cultural practices (alternative eating, lack of information about traditional food). Some women reported living in fear of risking the health of their baby and conducted extreme behaviours such as purging and starving themselves. Conclusion A diagnosis of GDM has wide reaching consequences that are common to a diverse group of women. Threshold cut-offs for blood glucose levels have been determined using the risk of physiological harms to mother and baby. It may also be advantageous to consider the harms and benefits from a psychosocial and a physiological perspective. This may avoid unnecessary burden to an already vulnerable population.
Collapse
|
102
|
Scott AM, Kolstoe S, Ploem MCC, Hammatt Z, Glasziou P. Exempting low-risk health and medical research from ethics reviews: comparing Australia, the United Kingdom, the United States and the Netherlands. Health Res Policy Syst 2020; 18:11. [PMID: 31992320 PMCID: PMC6986069 DOI: 10.1186/s12961-019-0520-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 12/18/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Disproportionate regulation of health and medical research contributes to research waste. Better understanding of exemptions of research from ethics review in different jurisdictions may help to guide modification of review processes and reduce research waste. Our aim was to identify examples of low-risk human health and medical research exempt from ethics reviews in Australia, the United Kingdom, the United States and the Netherlands. METHODS We examined documents providing national guidance on research ethics in each country, including those authored by the National Health and Medical Research Council (Australia), National Health Service (United Kingdom), the Office for Human Research Protections (United States) and the Central Committee on Research Involving Humans (the Netherlands). Examples and types of research projects exempt from ethics reviews were identified, and similar examples and types were grouped together. RESULTS Nine categories of research were exempt from ethics reviews across the four countries; these were existing data or specimen, questionnaire or survey, interview, post-marketing study, evaluation of public benefit or service programme, randomised controlled trials, research with staff in their professional role, audit and service evaluation, and other exemptions. Existing non-identifiable data and specimens were exempt in all countries. Four categories - evaluation of public benefit or service programme, randomised controlled trials, research with staff in their professional role, and audit and service evaluation - were exempted by one country each. The remaining categories were exempted by two or three countries. CONCLUSIONS Examples and types of research exempt from research ethics reviews varied considerably. Given the considerable costs and burdens on researchers and ethics committees, it would be worthwhile to develop and provide clearer guidance on exemptions, illustrated with examples, with transparent underpinning rationales.
Collapse
|
103
|
Clark J, Glasziou P, Del Mar C, Bannach-Brown A, Stehlik P, Scott AM. A full systematic review was completed in 2 weeks using automation tools: a case study. J Clin Epidemiol 2020; 121:81-90. [PMID: 32004673 DOI: 10.1016/j.jclinepi.2020.01.008] [Citation(s) in RCA: 173] [Impact Index Per Article: 43.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 12/17/2019] [Accepted: 01/18/2020] [Indexed: 01/04/2023]
Abstract
BACKGROUND AND OBJECTIVES Systematic reviews (SRs) are time and resource intensive, requiring approximately 1 year from protocol registration to submission for publication. Our aim was to describe the process, facilitators, and barriers to completing the first 2-week full SR. STUDY DESIGN AND SETTING We systematically reviewed evidence of the impact of increased fluid intake, on urinary tract infection (UTI) recurrence, in individuals at risk for UTIs. The review was conducted by experienced systematic reviewers with complementary skills (two researcher clinicians, an information specialist, and an epidemiologist), using Systematic Review Automation tools, and blocked off time for the duration of the project. The outcomes were time to complete the SR, time to complete individual SR tasks, facilitators and barriers to progress, and peer reviewer feedback on the SR manuscript. Times to completion were analyzed quantitatively (minutes and calendar days); facilitators and barriers were mapped onto the Theoretical Domains Framework; and peer reviewer feedback was analyzed quantitatively and narratively. RESULTS The SR was completed in 61 person-hours (9 workdays; 12 calendar days); accepted version of the manuscript required 71 person-hours. Individual SR tasks ranged from 16 person-minutes (deduplication of search results) to 461 person-minutes (data extraction). The least time-consuming SR tasks were obtaining full-texts, searches, citation analysis, data synthesis, and deduplication. The most time-consuming tasks were data extraction, write-up, abstract screening, full-text screening, and risk of bias. Facilitators and barriers mapped onto the following domains: knowledge; skills; memory, attention, and decision process; environmental context and resources; and technology and infrastructure. Two sets of peer reviewer feedback were received on the manuscript: the first included 34 comments requesting changes, 17 changes were made, requiring 173 person-minutes; the second requested 13 changes, and eight were made, requiring 121 person-minutes. CONCLUSION A small and experienced systematic reviewer team using Systematic Review Automation tools who have protected time to focus solely on the SR can complete a moderately sized SR in 2 weeks.
Collapse
|
104
|
Sanders S, Thomas R, Glasziou P, Doust J. A review of changes to the attention deficit/hyperactivity disorder age of onset criterion using the checklist for modifying disease definitions. BMC Psychiatry 2019; 19:357. [PMID: 31718626 PMCID: PMC6849294 DOI: 10.1186/s12888-019-2337-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 10/23/2019] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Widening definitions of health conditions have the potential to affect millions of people and should only occur when there is strong evidence of benefit. In the last version of the Diagnostic and Statistical Manual of Mental Disorders (DSM), the DSM-5 Committee changed the Attention Deficit Hyperactivity Disorder (ADHD) age of onset criterion in two ways: raising the age of symptom onset and removing the requirement for symptoms to cause impairment. Given concerns about ADHD prevalence and treatment rates, we aimed to evaluate the evidence available to support these changes using a recently developed Checklist for Modifying Disease Definitions. METHODS We identified and analysed research informing changes to the DSM-IV-TR ADHD age of onset criterion. We compared this evidence to the evidence recommended in the Checklist for Modifying Disease Definitions. RESULTS The changes to the DSM-IV-TR age of onset criterion were based on a literature review (publicly available as a 2 page document with online table of included studies), which we appraised as at high risk of bias. Estimates of the change in ADHD prevalence resulting from change to the age of onset criterion were based on a single study that included only a small number of children with ADHD (n = 68) and only assessed the impact of change to the age component of the criterion. No evidence was used by, or available to the Committee regarding the impact on prevalence of removal of the requirement for impairment, or the effect of the criterion changes on diagnostic precision, the prognosis of, or the potential benefits or harms for individuals diagnosed by the new, but not old criterion. CONCLUSIONS The changes to the age of onset criterion were based on minimal research evidence that suffered from either high risk of bias or poor applicability. The minimal documentation available makes it difficult to judge the rigor of the process behind the criterion changes. Use of the Checklist for Modifying Disease Definitions would assist future proposed modifications of the DSM ADHD criteria, provide guidance on the studies needed to inform potential changes and would improve the transparency and documentation of the process.
Collapse
|
105
|
Scott AM, Bakhit M, Clark J, Vermeulen M, Jones M, Looke D, Del Mar C, Glasziou P. Australian state influenza notifications and school holiday closures in 2019. F1000Res 2019; 8:1899. [PMID: 33976871 PMCID: PMC8097737 DOI: 10.12688/f1000research.21145.3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/12/2021] [Indexed: 11/20/2022] Open
Abstract
Background: The impact of school holidays on influenza rates has been sparsely documented in Australia. In 2019, the early winter influenza season coincided with mid-year school breaks, enabling us the unusual opportunity to examine how influenza incidence changed during school holiday closure dates. Methods: The weekly influenza data from five Australian state and one territory health departments for the period of week 19 (mid-May) to week 39 (early October) 2019 were compared to each state's public-school holiday closure dates. We used segmented regression to model the weekly counts and a negative binomial distribution to account for overdispersion due to autocorrelation. The models' goodness-of-fit was assessed by plots of observed versus expected counts, plots of residuals versus predicted values, and Pearson's Chi-square test. The main exposure was the July two-week school holiday period, using a lag of one week. The effect is estimated as a percent change in incidence level, and in slope. Results: School holidays were associated with significant declines in influenza incidence in three states and one territory by between 41% and 65%. Two states did not show evidence of declines although one of those states had already passed its peak by the time of the school holidays. The models showed acceptable goodness-of-fit. The first decline during school holidays is seen in the school aged (5-19 years) population, with the declines in the adult and infant populations being smaller and following a week later. Conclusions: Given the significant and rapid reductions in incidence, these results have important public health implications. Closure or extension of holiday periods could be an emergency option for state governments.
Collapse
|
106
|
Scott AM, Stehlik P, Clark J, Zhang D, Yang Z, Hoffmann T, Mar CD, Glasziou P. Blue-Light Therapy for Acne Vulgaris: A Systematic Review and Meta-Analysis. Ann Fam Med 2019; 17:545-553. [PMID: 31712293 PMCID: PMC6846280 DOI: 10.1370/afm.2445] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 03/21/2019] [Accepted: 04/11/2019] [Indexed: 01/28/2023] Open
Abstract
PURPOSE Antibiotic use in acne treatment raises concerns about increased resistance, necessitating alternatives. We assessed the effectiveness of blue-light therapy for acne. METHODS We analyzed randomized controlled trials comparing blue light with nonlight interventions. Studies included people of any age, sex, and acne severity, in any setting, and reported on investigator-assessed change in acne severity, patients' assessment of improvement, change in inflammatory or noninflammatory lesions, and adverse events. Where data were sufficient, mean differences were calculated. RESULTS Eighteen references (14 trials) including 698 participants were included. Most of the trials were small and short (<12 weeks) and had high risk of bias. Investigator-assessed improvement was quantitatively reported in 5 trials, of which 3 reported significantly greater improvement in blue light than comparator, and 2 reported improvement. Patients' assessments of improvement were quantitatively reported by 2 trials, favoring blue light. Mean difference in the mean number of noninflammatory lesions was nonsignificant between groups at weeks 4, 8, and 10-12 and overall (mean difference [MD] = 3.47; 95% CI, -0.76 to 7.71; P = 0.11). Mean difference in the mean number of inflammatory lesions was likewise nonsignificant between groups at any of the time points and overall (MD = 0.16; 95% CI, -0.99 to 1.31; P = 0.78). Adverse events were generally mild and favored blue light or did not significantly differ between groups. CONCLUSION Methodological and reporting limitations of existing evidence limit conclusions about the effectiveness of blue light for acne. Clinicians and patients should therefore consider the balance between its benefits and adverse events, as well as costs.
Collapse
|
107
|
Gillies K, Chalmers I, Glasziou P, Elbourne D, Elliott J, Treweek S. Reducing research waste by promoting informed responses to invitations to participate in clinical trials. Trials 2019; 20:613. [PMID: 31661029 PMCID: PMC6819580 DOI: 10.1186/s13063-019-3704-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 09/05/2019] [Indexed: 11/10/2022] Open
Abstract
Poor recruitment to, and retention in, clinical trials is a source of research waste that could be reduced by more informed choices about participation. Barriers to effective recruitment and retention can be wide-ranging but relevance of the questions being addressed by trials and the outcomes that they are assessing are key for potential participants. Decisions about trial participation should be informed by general and trial-specific information and by considering broader assessments of 'informedness' and how they impact on both recruitment and retention. We suggest that more informed decisions about trial participation should encourage personally appropriate decisions, increase recruitment and retention, and reduce research waste and increase its value.
Collapse
|
108
|
Moynihan R, Brodersen J, Heath I, Johansson M, Kuehlein T, Minué-Lorenzo S, Petursson H, Pizzanelli M, Reventlow S, Sigurdsson J, Stavdal A, Treadwell J, Glasziou P. Reforming disease definitions: a new primary care led, people-centred approach. BMJ Evid Based Med 2019; 24:170-173. [PMID: 30962252 DOI: 10.1136/bmjebm-2018-111148] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/22/2019] [Indexed: 12/11/2022]
|
109
|
Cleo G, Beller E, Glasziou P, Isenring E, Thomas R. Efficacy of habit-based weight loss interventions: a systematic review and meta-analysis. J Behav Med 2019; 43:519-532. [DOI: 10.1007/s10865-019-00100-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 09/04/2019] [Indexed: 10/26/2022]
|
110
|
Razavi M, Glasziou P, Klocksieben FA, Ioannidis JPA, Chalmers I, Djulbegovic B. US Food and Drug Administration Approvals of Drugs and Devices Based on Nonrandomized Clinical Trials: A Systematic Review and Meta-analysis. JAMA Netw Open 2019; 2:e1911111. [PMID: 31509209 PMCID: PMC6739727 DOI: 10.1001/jamanetworkopen.2019.11111] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
IMPORTANCE The size of estimated treatment effects on the basis of which the US Food and Drug Administration (FDA) has approved drugs and devices with data from nonrandomized clinical trials (non-RCTs) remains unknown. OBJECTIVES To determine how often the FDA has authorized novel interventions based on non-RCTs and to assess whether there is an association of the magnitude of treatment effects with FDA requirements for additional testing in randomized clinical trials (RCTs). DATA SOURCES Overall, 606 drug applications for the Breakthrough Therapy designation from its inception in January 2012 were downloaded from the FDA website in January 2017 and August 2018, and 71 medical device applications for the Humanitarian Device Exemption from its inception in June 1996 were downloaded in August 2017. STUDY SELECTION Approved applications based on non-RCTs were included; RCTs, studies with insufficient information, duplicates, and safety data were excluded. DATA EXTRACTION AND SYNTHESIS Data were extracted by 2 independent investigators. A statistical association of the magnitude of estimated effect (expressed as an odds ratio) with FDA requests for RCTs was assessed. The data were also meta-analyzed to evaluate the differences in odds ratios between applications that required further testing and those that did not. The results are reported according to Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines. MAIN OUTCOMES AND MEASURES Disease, laboratory, and patient-related outcomes, including disease response or patient survival, were considered. RESULTS Among 677 drug and medical device applications, 68 (10.0%) were approved by the FDA based on non-RCTs. Estimates of effects were larger when no further RCTs were required (mean natural logarithm of the odds ratios, 2.18 vs 1.12; odds ratios, 8.85 vs 3.06; P = .03). The meta-analysis results confirmed these findings: estimated effects were approximately 2.5-fold higher for treatments or devices that were approved based on non-RCTs than for treatments or devices for which further testing in RCTs was required (6.30 [95% CI, 4.38-9.06] vs 2.46 [95% CI, 1.70-3.56]; P < .001). Overall, 9 of 677 total applications (1.3%) that were approved on the basis of non-RCTs had relative risks of 10 or greater and 12 (1.7%) had relative risks of 5 or greater. No clear threshold above which the FDA approved interventions based on the magnitude of estimated effect alone was detected. CONCLUSIONS AND RELEVANCE In this study, estimated magnitudes of effect were larger among studies for which the FDA did not require RCTs compared with studies for which it did. There was no clear threshold of treatment effect above which no RCTs were requested.
Collapse
|
111
|
Djulbegovic B, Glasziou P, Chalmers I. The importance of randomised vs non-randomised trials. Lancet 2019; 394:634-635. [PMID: 31448731 DOI: 10.1016/s0140-6736(19)31128-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Accepted: 04/12/2019] [Indexed: 11/16/2022]
|
112
|
Elessi K, Albarqouni L, Glasziou P, Chalmers I. Promoting critical appraisal skills. Lancet 2019; 393:2589-2590. [PMID: 31258125 DOI: 10.1016/s0140-6736(19)30346-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 02/06/2019] [Indexed: 11/30/2022]
|
113
|
Thomas ET, Guppy M, Straus SE, Bell KJL, Glasziou P. Rate of normal lung function decline in ageing adults: a systematic review of prospective cohort studies. BMJ Open 2019; 9:e028150. [PMID: 31248928 PMCID: PMC6597635 DOI: 10.1136/bmjopen-2018-028150] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Revised: 03/01/2019] [Accepted: 05/14/2019] [Indexed: 11/30/2022] Open
Abstract
OBJECTIVE To conduct a systematic review investigating the normal age-related changes in lung function in adults without known lung disease. DESIGN Systematic review. DATA SOURCES MEDLINE, Embase and Cumulative Index to Nursing and Allied Health Literature (CINAHL) were searched for eligible studies from inception to February 12, 2019, supplemented by manual searches of reference lists and clinical trial registries. ELIGIBILITY CRITERIA We planned to include prospective cohort studies and randomised controlled trials (control arms) that measured changes in lung function over time in asymptomatic adults without known respiratory disease. DATA EXTRACTION AND SYNTHESIS Two authors independently determined the eligibility of studies, extracted data and assessed the risk of bias of included studies using the modified Newcastle-Ottawa Scale. RESULTS From 4385 records screened, we identified 16 cohort studies with 31 099 participants. All included studies demonstrated decline in lung function-forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC) and peak expiratory flow rate (PEFR) with age. In studies with longer follow-up (>10 years), rates of FEV1 decline ranged from 17.7 to 46.4 mL/year (median 22.4 mL/year). Overall, men had faster absolute rates of decline (median 43.5 mL/year) compared with women (median 30.5 mL/year). Differences in relative FEV1 change, however, were not observed between men and women. FEV1/FVC change was reported in only one study, declining by 0.29% per year. An age-specific analysis suggested the rate of FEV1 function decline may accelerate with each decade of age. CONCLUSIONS Lung function-FEV1, FVC and PEFR-decline with age in individuals without known lung disease. The definition of chronic airway disease may need to be reconsidered to allow for normal ageing and ensure that people likely to benefit from interventions are identified rather than healthy people who may be harmed by potential overdiagnosis and overtreatment. The first step would be to apply age, sex and ethnicity-adjusted FEV1/FVC thresholds to the disease definition of chronic obstructive pulmonary disease. PROSPERO REGISTRATION NUMBER CRD42018087066.
Collapse
|
114
|
O’Connor AM, Tsafnat G, Thomas J, Glasziou P, Gilbert SB, Hutton B. A question of trust: can we build an evidence base to gain trust in systematic review automation technologies? Syst Rev 2019; 8:143. [PMID: 31215463 PMCID: PMC6582554 DOI: 10.1186/s13643-019-1062-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 06/05/2019] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Although many aspects of systematic reviews use computational tools, systematic reviewers have been reluctant to adopt machine learning tools. DISCUSSION We discuss that the potential reason for the slow adoption of machine learning tools into systematic reviews is multifactorial. We focus on the current absence of trust in automation and set-up challenges as major barriers to adoption. It is important that reviews produced using automation tools are considered non-inferior or superior to current practice. However, this standard will likely not be sufficient to lead to widespread adoption. As with many technologies, it is important that reviewers see "others" in the review community using automation tools. Adoption will also be slow if the automation tools are not compatible with workflows and tasks currently used to produce reviews. Many automation tools being developed for systematic reviews mimic classification problems. Therefore, the evidence that these automation tools are non-inferior or superior can be presented using methods similar to diagnostic test evaluations, i.e., precision and recall compared to a human reviewer. However, the assessment of automation tools does present unique challenges for investigators and systematic reviewers, including the need to clarify which metrics are of interest to the systematic review community and the unique documentation challenges for reproducible software experiments. CONCLUSION We discuss adoption barriers with the goal of providing tool developers with guidance as to how to design and report such evaluations and for end users to assess their validity. Further, we discuss approaches to formatting and announcing publicly available datasets suitable for assessment of automation technologies and tools. Making these resources available will increase trust that tools are non-inferior or superior to current practice. Finally, we identify that, even with evidence that automation tools are non-inferior or superior to current practice, substantial set-up challenges remain for main stream integration of automation into the systematic review process.
Collapse
|
115
|
Byambasuren O, Beller E, Glasziou P. Current Knowledge and Adoption of Mobile Health Apps Among Australian General Practitioners: Survey Study. JMIR Mhealth Uhealth 2019; 7:e13199. [PMID: 31199343 PMCID: PMC6592476 DOI: 10.2196/13199] [Citation(s) in RCA: 85] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 04/30/2019] [Accepted: 05/11/2019] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Mobile health (mHealth) apps can be prescribed as an effective self-management tool for patients. However, it is challenging for doctors to navigate 350,000 mHealth apps to find the right ones to recommend. Although medical professionals from many countries are using mHealth apps to varying degrees, current mHealth app use by Australian general practitioners (GPs) and the barriers and facilitators they encounter when integrating mHealth apps in their clinical practice have not been reported comprehensively. OBJECTIVE The objectives of this study were to (1) evaluate current knowledge and use of mHealth apps by GPs in Australia, (2) determine the barriers and facilitators to their use of mHealth apps in consultations, and (3) explore potential solutions to the barriers. METHODS We helped the Royal Australian College of General Practitioners (RACGP) to expand the mHealth section of their annual technology survey for 2017 based on the findings of our semistructured interviews with GPs to further explore barriers to using mHealth apps in clinical practice. The survey was distributed to the RACGP members nationwide between October 26 and December 3, 2017 using Qualtrics Web-based survey tool. RESULTS A total of 1014 RACGP members responded (response rate 4.6% [1014/21,884], completion rate 61.2% [621/1014]). The median years practiced was 20.7 years. Two-thirds of the GPs used apps professionally in the forms of medical calculators and point-of-care references. A little over half of the GPs recommended apps for patients either daily (12.9%, 80/621), weekly (25.9%, 161/621), or monthly (13.4%, 83/621). Mindfulness and mental health apps were recommended most often (32.5%, 337/1036), followed by diet and nutrition (13.9%, 144/1036), exercise and fitness (12.7%, 132/1036), and women's health (10%, 104/1036) related apps. Knowledge and usage of evidence-based apps from the Handbook of Non-Drug Interventions were low. The prevailing barriers to app prescription were the lack of knowledge of effective apps (59.9%, 372/621) and the lack of trustworthy source to access them (15.5%, 96/621). GPs expressed their need for a list of safe and effective apps from a trustworthy source, such as the RACGP, to overcome these barriers. They reported a preference for online video training material or webinar to learn more about mHealth apps. CONCLUSIONS Most GPs are using apps professionally but recommending apps to patients sparingly. The main barriers to app prescription were the lack of knowledge of effective apps and the lack of trustworthy source to access them. A curated compilation of effective mHealth apps or an app library specifically aimed at GPs and health professionals would help solve both barriers.
Collapse
|
116
|
Thomas ET, Glasziou P, Dobler CC. Use of the terms "overdiagnosis" and "misdiagnosis" in the COPD literature: a rapid review. Breathe (Sheff) 2019; 15:e8-e19. [PMID: 31031840 PMCID: PMC6481986 DOI: 10.1183/20734735.0354-2018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Challenges in the diagnostic process of chronic obstructive pulmonary disease (COPD) can result in diagnostic misclassifications, including overdiagnosis. The term "overdiagnosis" in general has been associated with variable definitions. In connection with efforts to reduce low-value care, "overdiagnosis" has been defined as a true positive diagnosis of a condition that is not associated with any harm in the diagnosed person. It is, however, unclear how the term "overdiagnosis" is used in the COPD literature. We conducted a rapid review of the literature to explore how the terms "overdiagnosis" and "misdiagnosis" are used in the context of COPD. Electronic searches of Medline were conducted from inception to October 2018, to identify primary studies that reported on over- and/or misdiagnosis of COPD using these terms. 28 articles were included in this review. Overdiagnosis and misdiagnosis in COPD were found to be used to describe five main concepts: 1) physician COPD diagnosis despite normal spirometry (14 studies); 2) discordant results for COPD diagnosis based on different spirometry-based definitions for airflow obstruction (10 studies); 3) COPD diagnosis based on pre-bronchodilator spirometry results (three studies); 4) comorbidities (e.g. heart failure or asthma) that affect spirometry and have clinical features which overlap with COPD (two studies); and 5) normalisation of abnormal (post-bronchodilator) spirometry at follow-up (one study). The terms "overdiagnosis" and "misdiagnosis" were often used interchangeably and almost always referred to a false positive diagnosis. Performing (technically correct) spirometry with correct interpretation of the results could probably reduce misdiagnosis in a large proportion of the misdiagnosed cases of COPD. In addition, guidelines need to provide a more acceptable consensus spirometric definition of airflow obstruction. Key points In the COPD literature, the terms "overdiagnosis" and "misdiagnosis" are often used interchangeably and almost always refer to a false positive diagnosis.Use of spirometry with correct interpretation of the results can avoid a substantial proportion of cases of misdiagnosis of COPD. Educational aims To explore the use of the terms "overdiagnosis" and "misdiagnosis" in the COPD literature.To identify the main sources of overdiagnosis and misdiagnosis in COPD.
Collapse
|
117
|
Veroniki AA, Bender R, Glasziou P, Straus SE, Tricco AC. The number needed to treat in pairwise and network meta-analysis and its graphical representation. J Clin Epidemiol 2019; 111:11-22. [PMID: 30905696 DOI: 10.1016/j.jclinepi.2019.03.007] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Revised: 11/30/2018] [Accepted: 03/06/2019] [Indexed: 12/22/2022]
Abstract
OBJECTIVE The objective of this study was to present ways to graphically represent a number needed to treat (NNT) in (network) meta-analysis (NMA). STUDY DESIGN AND SETTING A barrier to using NNT in NMA when an odds ratio (OR) or risk ratio (RR) is used is the determination of a single control event rate (CER). We discuss approaches to calculate a CER, and illustrate six graphical methods for NNT from NMA. We illustrate the graphical approaches using an NMA of cognitive enhancers for Alzheimer's dementia. RESULTS The NNT calculation using a relative effect measure, such as OR and RR, requires a CER value, but different CERs, including mean CER across studies, pooled CER in meta-analysis, and expert opinion-based CER may result in different NNTs. An NNT from NMA can be presented in a bar plot, Cates plot, or forest plot for a single outcome, and a bubble plot, scatterplot, or rank-heat plot for ≥2 outcomes. Each plot is associated with different properties and can serve different needs. CONCLUSION Caution is needed in NNT interpretation, as considerations such as selection of effect size and CER, and CER assumption across multiple comparisons, may impact NNT and decision-making. The proposed graphs are helpful to interpret NNTs calculated from (network) meta-analyses.
Collapse
|
118
|
Pathirana T, Hayen A, Doust J, Glasziou P, Bell K. Lifetime risk of prostate cancer overdiagnosis in Australia: quantifying the risk of overdiagnosis associated with prostate cancer screening in Australia using a novel lifetime risk approach. BMJ Open 2019; 9:e022457. [PMID: 30858156 PMCID: PMC6429722 DOI: 10.1136/bmjopen-2018-022457] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
OBJECTIVES To quantify the risk of overdiagnosis associated with prostate cancer screening in Australia using a novel lifetime risk approach. DESIGN Modelling and validation of the lifetime risk method using publicly available population data. SETTING Opportunistic screening for prostate cancer in the Australian population. PARTICIPANTS Australian male population (1982-2012). INTERVENTIONS Prostate-specific antigen testing for prostate cancer screening. PRIMARY AND SECONDARY OUTCOME MEASURES Primary: lifetime risk of overdiagnosis in 2012 (excess lifetime cancer risk adjusted for changing competing mortality); Secondary: lifetime risk of prostate cancer diagnosis (unadjusted and adjusted for competing mortality); Excess lifetime risk of prostate cancer diagnosis (for all years subsequent to 1982). RESULTS The lifetime risk of being diagnosed with prostate cancer increased from 6.1% in 1982 (1 in 17) to 19.6% in 2012 (1 in 5). Using 2012 competing mortality rates, the lifetime risk in 1982 was 11.5% (95% CI 11.0% to 12.0%). The excess lifetime risk of prostate cancer in 2012 (adjusted for changing competing mortality) was 8.2% (95% CI 7.6% to 8.7%) (1 in 13). This corresponds to 41% of prostate cancers being overdiagnosed. CONCLUSIONS Our estimated rate of overdiagnosis is in agreement with estimates using other methods. This method may be used without the need to adjust for lead times. If annual (cross-sectional) data are used, then it may give valid estimates of overdiagnosis once screening has been established long enough for the benefits from the early detection of non-overdiagnosed cancer at a younger age to be realised in older age groups.
Collapse
|
119
|
Glasziou P, Chalmers I. Authors' reply to Sharp and Curlewis, Ahmed and colleagues, and Sreeharan. BMJ 2019; 364:l808. [PMID: 30850382 DOI: 10.1136/bmj.l808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
120
|
Albarqouni L, Hoffmann T, McLean K, Price K, Glasziou P. Role of professional networks on social media in addressing clinical questions at general practice: a cross-sectional study of general practitioners in Australia and New Zealand. BMC FAMILY PRACTICE 2019; 20:43. [PMID: 30841866 PMCID: PMC6404315 DOI: 10.1186/s12875-019-0931-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 02/27/2019] [Indexed: 11/10/2022]
Abstract
Background Clinicians frequently have questions about patient care. However, for more than half of the generated questions, answers are never pursued, and if they are, often not answered satisfactorily. We aimed to characterise the clinical questions asked and answers provided by general practitioners (GP) through posts to a popular professional social media network. Methods In this cross-sectional study, we analysed clinical questions and answers posted between January 20th and February 10th 2018 on a popular GP-restricted (Australia, New Zealand) Facebook group. Each clinical question was categorised according to ‘background’ or ‘foreground’ question; type (e.g. treatment, diagnosis); and the clinical topic (e.g. cardiovascular). Each answer provided in response to included questions was categorised into: (i) short answer (e.g. agree/disagree); (ii) provided an explanation to justify the answer; and (iii) referred to a published relevant evidence resource. Results Of 1060 new posts during the study period, 204 (19%) included a clinical question. GPs most commonly asked about treatment (n = 87; 43%) and diagnosis (n = 59; 29%). Five major topics (23% skin, 10% psychology, 9% cardiovascular, 8% female genital, and 7% musculoskeletal) accounted for 118 (58%) questions. Each question received on average 10 (SD = 9) answers: 42% were short; 51% provided an explanation; and only 6% referred to relevant research evidence. Only 3 answers referred to systematic reviews. Conclusions In this sample of Australian and New Zealand GPs, who were members of a GP social media group, GPs asked clinical questions that can be organised into a limited number of question types and topics. This might help guide the development of GP learning programs.
Collapse
|
121
|
O'Connor AM, Tsafnat G, Gilbert SB, Thayer KA, Shemilt I, Thomas J, Glasziou P, Wolfe MS. Still moving toward automation of the systematic review process: a summary of discussions at the third meeting of the International Collaboration for Automation of Systematic Reviews (ICASR). Syst Rev 2019; 8:57. [PMID: 30786933 PMCID: PMC6381675 DOI: 10.1186/s13643-019-0975-y] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 02/12/2019] [Indexed: 12/28/2022] Open
Abstract
The third meeting of the International Collaboration for Automation of Systematic Reviews (ICASR) was held 17-18 October 2017 in London, England. ICASR is an interdisciplinary group whose goal is to maximize the use of technology for conducting rapid, accurate, and efficient systematic reviews of scientific evidence. The group seeks to facilitate the development and widespread acceptance of automated techniques for systematic reviews. The meeting's conclusion was that the most pressing needs at present are to develop approaches for validating currently available tools and to provide increased access to curated corpora that can be used for validation. To that end, ICASR's short-term goals in 2018-2019 are to propose and publish protocols for key tasks in systematic reviews and to develop an approach for sharing curated corpora for validating the automation of the key tasks.
Collapse
|
122
|
Bell KJL, Doust J, Glasziou P, Cullen L, Harris IA, Smith L, Buchbinder R, Barratt A. Recognizing the Potential for Overdiagnosis: Are High-Sensitivity Cardiac Troponin Assays an Example? Ann Intern Med 2019; 170:259-261. [PMID: 30716768 DOI: 10.7326/m18-2645] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
123
|
Schünemann HJ, Mustafa RA, Brozek J, Santesso N, Bossuyt PM, Steingart KR, Leeflang M, Lange S, Trenti T, Langendam M, Scholten R, Hooft L, Murad MH, Jaeschke R, Rutjes A, Singh J, Helfand M, Glasziou P, Arevalo-Rodriguez I, Akl EA, Deeks JJ, Guyatt GH. GRADE guidelines: 22. The GRADE approach for tests and strategies-from test accuracy to patient-important outcomes and recommendations. J Clin Epidemiol 2019; 111:69-82. [PMID: 30738926 DOI: 10.1016/j.jclinepi.2019.02.003] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 11/14/2018] [Accepted: 02/04/2019] [Indexed: 11/28/2022]
Abstract
OBJECTIVES This article describes the Grading of Recommendations Assessment, Development and Evaluation (GRADE) Working Group's framework of moving from test accuracy to patient or population-important outcomes. We focus on the common scenario when studies directly evaluating the effect of diagnostic and other tests or strategies on health outcomes are not available or are not providing the best available evidence. STUDY DESIGN AND SETTING Using practical examples, we explored how guideline developers and other decision makers can use information from test accuracy to develop a recommendation by linking evidence that addresses downstream consequences. Guideline panels should develop an analytic framework that summarizes the actions that follow from applying a test and the consequences. RESULTS We describe GRADE's current thinking about the overall certainty of the evidence (also known as quality of the evidence or confidence in the estimates) arising from consideration of the often complex pathways that involve multiple tests and management options. Each link in the evidence can-and often does-lower the overall certainty of the evidence required to formulate recommendations and make decisions about tests. The frequency with which an outcome occurs and its importance will influence whether or not a particular step in the linked evidence is critical to decision-making. CONCLUSIONS Overall certainty may be expressed by the weakest critical step in the linked evidence. The linked approach to addressing optimal testing will often require the use of decision analytic approaches. We present an example that involves decision modeling in a GRADE Evidence to Decision framework for cervical cancer screening. However, because resources and time of guideline developers may be limited, we describe alternative, pragmatic strategies for developing recommendations addressing test use.
Collapse
|
124
|
Chalmers I, Atkinson P, Badenoch D, Glasziou P, Austvoll-Dahlgren A, Oxman A, Clarke M. The James Lind Initiative: books, websites and databases to promote critical thinking about treatment claims, 2003 to 2018. RESEARCH INVOLVEMENT AND ENGAGEMENT 2019; 5:6. [PMID: 30766728 PMCID: PMC6360692 DOI: 10.1186/s40900-019-0138-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Accepted: 01/15/2019] [Indexed: 06/09/2023]
Abstract
BACKGROUND The James Lind Initiative (JLI) was a work programme inaugurated by Iain Chalmers and Patricia Atkinson to press for better research for better health care. It ran between 2003 and 2018, when Iain Chalmers retired. During the 15 years of its existence, the JLI developed three strands of work in collaboration with the authors of this paper, and with others. WORK THEMES The first work strand involved developing a process for use by patients, carers and clinicians to identify shared priorities for research - the James Lind Alliance. The second strand was a series of articles, meetings, prizes and other developments to raise awareness of the massive amounts of avoidable waste in research, and of ways of reducing it. The third strand involved using a variety of approaches to promote better public and professional understanding of the importance of research in clinical practice and public health. JLI work on the first two themes has been addressed in previously published reports. This paper summarises JLI involvement during the 15 years of its existence in giving talks, convening workshops, writing books, and creating websites and databases to promote critical thinking about treatment claims. CONCLUSION During its 15-year life, the James Lind Initiative worked collaboratively with others to create free teaching and learning resources to help children and adults learn how to recognise untrustworthy claims about the effects of treatments. These resources have been translated in more than twenty languages, but much more could be done to support their uptake and wider use.
Collapse
|
125
|
Thomas R, Sims R, Beller E, Scott AM, Doust J, Le Couteur D, Pond D, Loy C, Forlini C, Glasziou P. An Australian community jury to consider case-finding for dementia: Differences between informed community preferences and general practice guidelines. Health Expect 2019; 22:475-484. [PMID: 30714290 PMCID: PMC6543153 DOI: 10.1111/hex.12871] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 01/06/2019] [Accepted: 01/08/2019] [Indexed: 01/13/2023] Open
Abstract
Background Case‐finding for dementia is practised by general practitioners (GPs) in Australia but without an awareness of community preferences. We explored the values and preferences of informed community members around case‐finding for dementia in Australian general practice. Design, setting and participants A before and after, mixed‐methods study in Gold Coast, Australia, with ten community members aged 50‐70. Intervention A 2‐day citizen/community jury. Participants were informed by experts about dementia, the potential harms and benefits of case‐finding, and ethical considerations. Primary and secondary outcomes We asked participants, “Should the health system encourage GPs to practice ‘case‐finding’ of dementia in people older than 50?” Case‐finding was defined as a GP initiating testing for dementia when the patient is unaware of symptoms. We also assessed changes in participant comprehension/knowledge, attitudes towards dementia and participants’ own intentions to undergo case‐finding for dementia if it were suggested. Results Participants voted unanimously against case‐finding for dementia, citing a lack of effective treatments, potential for harm to patients and potential financial incentives. However, they recognized that case‐finding was currently practised by Australian GPs and recommended specific changes to the guidelines. Participants increased their comprehension/knowledge of dementia, their attitude towards case‐finding became less positive, and their intentions to be tested themselves decreased. Conclusion Once informed, community jury participants did not agree case‐finding for dementia should be conducted by GPs. Yet their personal intentions to accept case‐finding varied. If case‐finding for dementia is recommended in the guidelines, then shared decision making is essential.
Collapse
|