1
|
Sarafoglou A, Hoogeveen S, van den Bergh D, Aczel B, Albers CJ, Althoff T, Botvinik-Nezer R, Busch NA, Cataldo AM, Devezer B, van Dongen NNN, Dreber A, Fried EI, Hoekstra R, Hoffman S, Holzmeister F, Huber J, Huntington-Klein N, Ioannidis J, Johannesson M, Kirchler M, Loken E, Mangin JF, Matzke D, Menkveld AJ, Nilsonne G, van Ravenzwaaij D, Schweinsberg M, Schulz-Kuempel H, Shanks DR, Simons DJ, Spellman BA, Stoevenbelt AH, Szaszi B, Trübutschek D, Tuerlinckx F, Uhlmann EL, Vanpaemel W, Wicherts J, Wagenmakers EJ. Subjective evidence evaluation survey for many-analysts studies. ROYAL SOCIETY OPEN SCIENCE 2024; 11:240125. [PMID: 39050728 PMCID: PMC11265885 DOI: 10.1098/rsos.240125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Accepted: 04/22/2024] [Indexed: 07/27/2024]
Abstract
Many-analysts studies explore how well an empirical claim withstands plausible alternative analyses of the same dataset by multiple, independent analysis teams. Conclusions from these studies typically rely on a single outcome metric (e.g. effect size) provided by each analysis team. Although informative about the range of plausible effects in a dataset, a single effect size from each team does not provide a complete, nuanced understanding of how analysis choices are related to the outcome. We used the Delphi consensus technique with input from 37 experts to develop an 18-item subjective evidence evaluation survey (SEES) to evaluate how each analysis team views the methodological appropriateness of the research design and the strength of evidence for the hypothesis. We illustrate the usefulness of the SEES in providing richer evidence assessment with pilot data from a previous many-analysts study.
Collapse
Affiliation(s)
| | | | - Don van den Bergh
- Department of Psychology, University of Amsterdam, Amsterdam, The Netherlands
| | - Balazs Aczel
- Institute of Psychology, ELTE Eötvös Lorénd University, Budapest, Hungary
| | - Casper J. Albers
- Heymans Institute for Psychological Research, University of Groningen, Groningen, The Netherlands
| | - Tim Althoff
- Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Rotem Botvinik-Nezer
- Hebrew University of Jerusalem, Jerusalem, Israel
- Dartmouth College, Hanover, NH, USA
| | - Niko A. Busch
- Institute for Psychology, University of Münster, Münster, Germany
| | - Andrea M. Cataldo
- Center for Depression, Anxiety and Stress Research, McLean Hospital, Belmont, MA, USA
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA
| | - Berna Devezer
- Department of Business, University of Idaho, Moscow, ID, USA
| | | | - Anna Dreber
- Stockholm School of Economics, Stockholm, Sweden
- University of Innsbruck, Innsbruck, Tirol, Austria
| | - Eiko I. Fried
- Department of Psychology, Leiden University, Leiden, The Netherlands
| | - Rink Hoekstra
- Nieuwenhuis Institute for Educational Research, University of Groningen, Groningen, The Netherlands
| | - Sabine Hoffman
- Department of Statistics, Ludwig-Maximilians-Universität München, Munchen, Bayern, Germany
| | | | - Jürgen Huber
- University of Innsbruck, Innsbruck, Tirol, Austria
| | | | - John Ioannidis
- Meta-Research Innovation Center at Stanford (METRICS) and Departments of Medicine, of Epidemiology and of Population Health, of Biomedical Data Science, and of Statistics, Stanford University, Stanford, CA, USA
| | | | | | - Eric Loken
- University of Conneticut, Storrs, CT, USA
| | - Jan-Francois Mangin
- University Paris-Saclay, Gif-sur-Yvette, France
- Neurospin CEA, Gif-sur-Yvette, Île-de-France, France
| | - Dora Matzke
- Department of Psychology, University of Amsterdam, Amsterdam, The Netherlands
| | | | | | - Don van Ravenzwaaij
- Heymans Institute for Psychological Research, University of Groningen, Groningen, The Netherlands
| | | | - Hannah Schulz-Kuempel
- Department of Statistics and The Institute for Medical Information Processing, Biometry, and Epidemiology, LMU Munich, Munchen, Bayern, Germany
- The Institute for Medical Information Processing, Biometry, and Epidemiology, LMU Munich, Munchen, Bayern, Germany
| | - David R. Shanks
- Division of Psychology and Language Sciences, University College London, 26 Bedford Way, London WC1H 0AP, UK
| | | | - Barbara A. Spellman
- School of Law, University of Virginia, 580 Massie Road, Charlottesville, VA, USA
| | - Andrea H. Stoevenbelt
- Nieuwenhuis Institute for Educational Research, University of Groningen, Groningen, The Netherlands
| | - Barnabas Szaszi
- Institute of Psychology, ELTE Eötvös Lorénd University, Budapest, Hungary
| | | | | | | | | | - Jelte Wicherts
- Department of Methodology and Statistics, Tilburg University, Tilburg, The Netherlands
| | | |
Collapse
|
2
|
Girardi P, Vesely A, Lakens D, Altoè G, Pastore M, Calcagnì A, Finos L. Post-selection Inference in Multiverse Analysis (PIMA): An Inferential Framework Based on the Sign Flipping Score Test. PSYCHOMETRIKA 2024; 89:542-568. [PMID: 38664342 DOI: 10.1007/s11336-024-09973-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Indexed: 06/11/2024]
Abstract
When analyzing data, researchers make some choices that are either arbitrary, based on subjective beliefs about the data-generating process, or for which equally justifiable alternative choices could have been made. This wide range of data-analytic choices can be abused and has been one of the underlying causes of the replication crisis in several fields. Recently, the introduction of multiverse analysis provides researchers with a method to evaluate the stability of the results across reasonable choices that could be made when analyzing data. Multiverse analysis is confined to a descriptive role, lacking a proper and comprehensive inferential procedure. Recently, specification curve analysis adds an inferential procedure to multiverse analysis, but this approach is limited to simple cases related to the linear model, and only allows researchers to infer whether at least one specification rejects the null hypothesis, but not which specifications should be selected. In this paper, we present a Post-selection Inference approach to Multiverse Analysis (PIMA) which is a flexible and general inferential approach that considers for all possible models, i.e., the multiverse of reasonable analyses. The approach allows for a wide range of data specifications (i.e., preprocessing) and any generalized linear model; it allows testing the null hypothesis that a given predictor is not associated with the outcome, by combining information from all reasonable models of multiverse analysis, and provides strong control of the family-wise error rate allowing researchers to claim that the null hypothesis can be rejected for any specification that shows a significant effect. The inferential proposal is based on a conditional resampling procedure. We formally prove that the Type I error rate is controlled, and compute the statistical power of the test through a simulation study. Finally, we apply the PIMA procedure to the analysis of a real dataset on the self-reported hesitancy for the COronaVIrus Disease 2019 (COVID-19) vaccine before and after the 2020 lockdown in Italy. We conclude with practical recommendations to be considered when implementing the proposed procedure.
Collapse
Affiliation(s)
- Paolo Girardi
- Department of Environmental Sciences, Informatics and Statistics, Ca' Foscari University of Venice, Via Torino 155, 30172, Venezia-Mestre, VE, Italy.
| | - Anna Vesely
- Department of Statistical Sciences, University of Bologna, Bologna, Italy
| | - Daniël Lakens
- Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, Eindhoven, The Netherlands
| | - Gianmarco Altoè
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
| | - Massimiliano Pastore
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
| | - Antonio Calcagnì
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
- GNCS Research Group, GNCS-INdAM RESEARCH GROUP, Rome, Italy
| | - Livio Finos
- Department of Statistical Sciences, University of Padova, Padua, Italy
| |
Collapse
|
3
|
Wang W, Liu M, He Q, Wang M, Xu J, Li L, Li G, He L, Zou K, Sun X. Validation and impact of algorithms for identifying variables in observational studies of routinely collected data. J Clin Epidemiol 2024; 166:111232. [PMID: 38043830 DOI: 10.1016/j.jclinepi.2023.111232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 11/23/2023] [Accepted: 11/28/2023] [Indexed: 12/05/2023]
Abstract
BACKGROUND AND OBJECTIVES Among observational studies of routinely collected health data (RCD) for exploring treatment effects, algorithms are used to identify study variables. However, the extent to which algorithms are reliable and impact the credibility of effect estimates is far from clear. This study aimed to investigate the validation of algorithms for identifying study variables from RCD, and examine the impact of alternative algorithms on treatment effects. METHODS We searched PubMed for observational studies published in 2018 that used RCD to explore drug treatment effects. Information regarding the reporting, validation, and interpretation of algorithms was extracted. We summarized the reporting and methodological characteristics of algorithms and validation. We also assessed the divergence in effect estimates given alternative algorithms by calculating the ratio of estimates of the primary vs. alternative analyses. RESULTS A total of 222 studies were included, of which 93 (41.9%) provided a complete list of algorithms for identifying participants, 36 (16.2%) for exposure, and 132 (59.5%) for outcomes, and 15 (6.8%) for all study variables including population, exposure, and outcomes. Fifty-nine (26.6%) studies stated that the algorithms were validated, and 54 (24.3%) studies reported methodological characteristics of 66 validations, among which 61 validations in 49 studies were from the cross-referenced validation studies. Of those 66 validations, 22 (33.3%) reported sensitivity and 16 (24.2%) reported specificity. A total of 63.6% of studies reporting sensitivity and 56.3% reporting specificity used test-result-based sampling, an approach that potentially biases effect estimates. Twenty-eight (12.6%) studies used alternative algorithms to identify study variables, and 24 reported the effects estimated by primary analyses and sensitivity analyses. Of these, 20% had differential effect estimates when using alternative algorithms for identifying population, 18.2% for identifying exposure, and 45.5% for classifying outcomes. Only 32 (14.4%) studies discussed how the algorithms may affect treatment estimates. CONCLUSION In observational studies of RCD, the algorithms for variable identification were not regularly validated, and-even if validated-the methodological approach and performance of the validation were often poor. More seriously, different algorithms may yield differential treatment effects, but their impact is often ignored by researchers. Strong efforts, including recommendations, are warranted to improve good practice.
Collapse
Affiliation(s)
- Wen Wang
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China.
| | - Mei Liu
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China
| | - Qiao He
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China
| | - Mingqi Wang
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China
| | - Jiayue Xu
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China
| | - Ling Li
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China
| | - Guowei Li
- Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, Ontario L8S 4L8, Canada; Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangzhou, Guangdong 510317, China; Biostatistics Unit, Research Institute at St. Joseph's Healthcare Hamilton, Hamilton, Ontario L8N 4A6, Canada
| | - Lin He
- Intelligence Library Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Kang Zou
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China
| | - Xin Sun
- Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu 610041, China; NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu 610041, China; Sichuan Center of Technology Innovation for Real World Data, Chengdu 610041, China.
| |
Collapse
|
4
|
Chapelle C, Le Teuff G, Zufferey PJ, Laporte S, Ollier E. A framework to characterise the reproducibility of meta-analysis results with its application to direct oral anticoagulants in the acute treatment of venous thromboembolism. Res Synth Methods 2024; 15:117-129. [PMID: 37846195 DOI: 10.1002/jrsm.1676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 09/13/2023] [Accepted: 09/23/2023] [Indexed: 10/18/2023]
Abstract
The number of meta-analyses of aggregate data has dramatically increased due to the facility of obtaining data from publications and the development of free, easy-to-use, and specialised statistical software. Even when meta-analyses include the same studies, their results may vary owing to different methodological choices. Assessment of the replication of meta-analysis provides an example of the variation of effect 'naturally' observed between multiple research projects. Reproducibility of results has mostly been reported using graphical descriptive representations. A quantitative analysis of such results would enable (i) breakdown of the total observed variability with quantification of the variability generated by the replication process and (ii) identification of which variables account for this variability, such as methodological quality or the statistical analysis procedures used. These variables might explain systematic mean differences between results and dispersion of the results. To quantitatively characterise the reproducibility of meta-analysis results, a bivariate linear mixed-effects model was developed to simulate both mean results and their corresponding uncertainty. Results were assigned to several replication groups, those assessing the same studies, outcomes, treatment indication and comparisons classified in the same replication group. A nested random effect structure was used to break down the total variability within each replication group and between these groups to enable calculation of an intragroup correlation coefficient and quantification of reproducibility. Determinants of variability were investigated by modelling both mean and variance parameters using covariates. The proposed model was applied to the example of meta-analyses evaluating direct oral anticoagulants in the acute treatment of venous thromboembolism.
Collapse
Affiliation(s)
- Céline Chapelle
- Université Jean-Monnet, Mines Saint-Étienne, INSERM, U1059, SAINBIOSE, F-42023; Service de pharmacologie clinique, CHU Saint-Étienne, F-42055 Saint-Étienne, France, Université Jean Monnet, Saint-Étienne, France
| | - Gwénaël Le Teuff
- Service de Biostatistique et d'Épidémiologie, Gustave Roussy, Université Paris-Saclay, Villejuif, France; Oncostat U1018, Inserm, Équipe Labellisée Ligue Contre le Cancer, Université Paris-Saclay, Villejuif, France
| | - Paul Jacques Zufferey
- Département d'Anesthésie-Réanimation, Service de pharmacologie clinique, CHU Saint-Étienne, F-42055 Saint-Étienne; Université Jean-Monnet, Mines Saint- Étienne, INSERM, U1059, SAINBIOSE, F-42023, CHU Saint-Étienne, Saint-Étienne, France
| | - Silvy Laporte
- Université Jean-Monnet, Mines Saint-Étienne, INSERM, U1059, SAINBIOSE, F-42023; Service de pharmacologie clinique, CHU Saint-Étienne, F-42055 Saint-Étienne, France, Université Jean Monnet, Saint-Étienne, France
| | - Edouard Ollier
- Université Jean-Monnet, Mines Saint-Étienne, INSERM, U1059, SAINBIOSE, F-42023; Service de pharmacologie clinique, CHU Saint-Étienne, F-42055 Saint-Étienne, France, Université Jean Monnet, Saint-Étienne, France
| |
Collapse
|
5
|
Brockhaus EK, Wolffram D, Stadler T, Osthege M, Mitra T, Littek JM, Krymova E, Klesen AJ, Huisman JS, Heyder S, Helleckes LM, an der Heiden M, Funk S, Abbott S, Bracher J. Why are different estimates of the effective reproductive number so different? A case study on COVID-19 in Germany. PLoS Comput Biol 2023; 19:e1011653. [PMID: 38011276 PMCID: PMC10703420 DOI: 10.1371/journal.pcbi.1011653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 12/07/2023] [Accepted: 11/03/2023] [Indexed: 11/29/2023] Open
Abstract
The effective reproductive number Rt has taken a central role in the scientific, political, and public discussion during the COVID-19 pandemic, with numerous real-time estimates of this quantity routinely published. Disagreement between estimates can be substantial and may lead to confusion among decision-makers and the general public. In this work, we compare different estimates of the national-level effective reproductive number of COVID-19 in Germany in 2020 and 2021. We consider the agreement between estimates from the same method but published at different time points (within-method agreement) as well as retrospective agreement across eight different approaches (between-method agreement). Concerning the former, estimates from some methods are very stable over time and hardly subject to revisions, while others display considerable fluctuations. To evaluate between-method agreement, we reproduce the estimates generated by different groups using a variety of statistical approaches, standardizing analytical choices to assess how they contribute to the observed disagreement. These analytical choices include the data source, data pre-processing, assumed generation time distribution, statistical tuning parameters, and various delay distributions. We find that in practice, these auxiliary choices in the estimation of Rt may affect results at least as strongly as the selection of the statistical approach. They should thus be communicated transparently along with the estimates.
Collapse
Affiliation(s)
- Elisabeth K. Brockhaus
- Chair of Statistical Methods and Econometrics, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
| | - Daniel Wolffram
- Chair of Statistical Methods and Econometrics, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
- Computational Statistics Group, Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Michael Osthege
- Institute of Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich GmbH, Jülich, Germany
- Institute of Biotechnology, RWTH Aachen University, Aachen, Germany
| | - Tanmay Mitra
- Department of Systems Immunology and Braunschweig Integrated Centre of Systems Biology (BRICS), Helmholtz Centre for Infection Research, Braunschweig, Germany
- Current address: Kennedy Institute of Rheumatology, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, United Kingdom
| | - Jonas M. Littek
- Chair of Statistical Methods and Econometrics, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
| | - Ekaterina Krymova
- Swiss Data Science Center, EPF Lausanne and ETH Zurich, Zurich, Switzerland
| | - Anna J. Klesen
- Chair of Statistical Methods and Econometrics, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
| | - Jana S. Huisman
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
- Physics of Living Systems, Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Stefan Heyder
- Institute of Mathematics, Technische Universität Ilmenau, Ilmenau, Germany
| | - Laura M. Helleckes
- Institute of Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich GmbH, Jülich, Germany
- Institute of Biotechnology, RWTH Aachen University, Aachen, Germany
| | | | - Sebastian Funk
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Sam Abbott
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Johannes Bracher
- Chair of Statistical Methods and Econometrics, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
- Computational Statistics Group, Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
| |
Collapse
|
6
|
Levitt M, Zonta F, Ioannidis JPA. Excess death estimates from multiverse analysis in 2009-2021. Eur J Epidemiol 2023; 38:1129-1139. [PMID: 37043153 PMCID: PMC10090741 DOI: 10.1007/s10654-023-00998-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 03/27/2023] [Indexed: 04/13/2023]
Abstract
Excess death estimates have great value in public health, but they can be sensitive to analytical choices. Here we propose a multiverse analysis approach that considers all possible different time periods for defining the reference baseline and a range of 1 to 4 years for the projected time period for which excess deaths are calculated. We used data from the Human Mortality Database on 33 countries with detailed age-stratified death information on an annual basis during the period 2009-2021. The use of different time periods for reference baseline led to large variability in the absolute magnitude of the exact excess death estimates. However, the relative ranking of different countries compared to others for specific years remained largely unaltered. The relative ranking of different years for the specific country was also largely independent of baseline. Averaging across all possible analyses, distinct time patterns were discerned across different countries. Countries had declines between 2009 and 2019, but the steepness of the decline varied markedly. There were also large differences across countries on whether the COVID-19 pandemic years 2020-2021 resulted in an increase of excess deaths and by how much. Consideration of longer projected time windows resulted in substantial shrinking of the excess deaths in many, but not all countries. Multiverse analysis of excess deaths over long periods of interest can offer an approach that better accounts for the uncertainty in estimating expected mortality patterns, comparative mortality trends across different countries, and the nature of observed mortality peaks.
Collapse
Affiliation(s)
- Michael Levitt
- Department of Structural Biology, Stanford University, Stanford, CA, 94305, USA
| | - Francesco Zonta
- Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, Shanghai, 201210, China
| | - John P A Ioannidis
- Departments of Medicine, of Epidemiology and Population Health, of Biomedical Data Science, and of Statistics, and Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
7
|
Glisic M, Raguindin PF, Gemperli A, Taneri PE, Salvador DJ, Voortman T, Marques Vidal P, Papatheodorou SI, Kunutsor SK, Bano A, Ioannidis JPA, Muka T. A 7-Step Guideline for Qualitative Synthesis and Meta-Analysis of Observational Studies in Health Sciences. Public Health Rev 2023; 44:1605454. [PMID: 37260612 PMCID: PMC10227668 DOI: 10.3389/phrs.2023.1605454] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 04/14/2023] [Indexed: 06/02/2023] Open
Abstract
Objectives: To provide a step-by-step, easy-to-understand, practical guide for systematic review and meta-analysis of observational studies. Methods: A multidisciplinary team of researchers with extensive experience in observational studies and systematic review and meta-analysis was established. Previous guidelines in evidence synthesis were considered. Results: There is inherent variability in observational study design, population, and analysis, making evidence synthesis challenging. We provided a framework and discussed basic meta-analysis concepts to assist reviewers in making informed decisions. We also explained several statistical tools for dealing with heterogeneity, probing for bias, and interpreting findings. Finally, we briefly discussed issues and caveats for translating results into clinical and public health recommendations. Our guideline complements "A 24-step guide on how to design, conduct, and successfully publish a systematic review and meta-analysis in medical research" and addresses peculiarities for observational studies previously unexplored. Conclusion: We provided 7 steps to synthesize evidence from observational studies. We encourage medical and public health practitioners who answer important questions to systematically integrate evidence from observational studies and contribute evidence-based decision-making in health sciences.
Collapse
Affiliation(s)
- Marija Glisic
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Swiss Paraplegic Research, Nottwil, Switzerland
| | - Peter Francis Raguindin
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Swiss Paraplegic Research, Nottwil, Switzerland
- Graduate School for Health Sciences, University of Bern, Bern, Switzerland
- Faculty of Health Science and Medicine, University of Lucerne, Lucerne, Switzerland
| | - Armin Gemperli
- Swiss Paraplegic Research, Nottwil, Switzerland
- Institute of Primary and Community Care, University of Lucerne, Lucerne, Switzerland
| | - Petek Eylul Taneri
- HRB-Trials Methodology Research Network, National University of Ireland, Galway, Ireland
| | - Dante Jr. Salvador
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Graduate School for Health Sciences, University of Bern, Bern, Switzerland
| | - Trudy Voortman
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, Netherlands
- Division of Human Nutrition and Health, Wageningen University and Research, Wageningen, Netherlands
| | - Pedro Marques Vidal
- Department of Medicine, Internal Medicine, Lausanne University Hospital (CHUV) and University of Lausanne, Lausanne, Switzerland
| | | | - Setor K. Kunutsor
- Diabetes Research Centre, University of Leicester, Leicester General Hospital, Leicester, United Kingdom
- Translational Health Sciences, Bristol Medical School, University of Bristol, Southmead Hospital, Bristol, United Kingdom
| | - Arjola Bano
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Department of Cardiology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - John P. A. Ioannidis
- Stanford Prevention Research Center, Department of Medicine, Stanford University School of Medicine, Stanford, CA, United States
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, United States
- Department of Statistics, Stanford University, Stanford, CA, United States
- Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, United States
| | - Taulant Muka
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, United States
- Epistudia, Bern, Switzerland
| |
Collapse
|
8
|
Visontay R, Mewton L, Sunderland M, Bell S, Britton A, Osman B, North H, Mathew N, Slade T. A comprehensive evaluation of the longitudinal association between alcohol consumption and a measure of inflammation: Multiverse and vibration of effects analyses. Drug Alcohol Depend 2023; 247:109886. [PMID: 37120919 DOI: 10.1016/j.drugalcdep.2023.109886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 02/16/2023] [Accepted: 04/16/2023] [Indexed: 05/02/2023]
Abstract
BACKGROUND Moderate alcohol consumption appears to be associated with reduced inflammation. Determining whether this association is robust to common variations in research parameters has wide-reaching implications for our understanding of disease aetiology and public health policy. We aimed to conduct comprehensive multiverse and vibration of effects analyses evaluating the associations between alcohol consumption and a measure of inflammation. METHODS A secondary analysis of the 1970 British Birth Cohort Study was performed, using data from 1970 through 2016. Measurements of alcohol consumption were taken in early/mid-adulthood (ages 34 and 42), and level of inflammation marker high-sensitivity C-reactive protein (hsCRP) at age 46. Multiverse analyses were applied to comparisons of low-to-moderate consumption and consumption above various international drinking guidelines with an 'abstinent' reference. Research parameters of interest related to: definitions of drinking and reference groups; alcohol consumption measurement year; outcome variable transformation; and breadth of covariate adjustment. After identifying various analytic options within these parameters and running the analysis over each unique option combination, specification curve plots, volcano plots, effect ranges, and variance decomposition metrics were used to assess consistency of results. RESULTS A total of 3101 individuals were included in the final analyses, with primary analyses limited to those where occasional consumers served as reference. All combinations of research specifications resulted in lower levels of inflammation amongst low-to-moderate consumers compared to occasional consumers (1st percentile effect: -0.21; 99th percentile effect: -0.04). Estimates comparing above-guidelines drinking with occasional consumers were less definitive (1st percentile effect: -0.26; 99th percentile effect: 0.43). CONCLUSIONS The association between low-to-moderate drinking and lower hsCRP levels is largely robust to common variations in researcher-defined parameters, warranting further research to establish whether this relationship is causal. The association between above-guidelines drinking and hsCRP levels is less definitive.
Collapse
Affiliation(s)
- Rachel Visontay
- The Matilda Centre for Research in Mental Health and Substance Use, Level 6, Jane Foss Russell Building, G02, The University of Sydney, NSW2006, Australia.
| | - Louise Mewton
- The Matilda Centre for Research in Mental Health and Substance Use, Level 6, Jane Foss Russell Building, G02, The University of Sydney, NSW2006, Australia; Centre for Healthy Brain Ageing, Level 1, AGSM (G27), University of New South Wales, Gate 11, Botany Street, Sydney, NSW2052, Australia
| | - Matthew Sunderland
- The Matilda Centre for Research in Mental Health and Substance Use, Level 6, Jane Foss Russell Building, G02, The University of Sydney, NSW2006, Australia
| | - Steven Bell
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK; British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, UK; Research Department of Epidemiology and Public Health, University College London, London, UK
| | - Annie Britton
- Research Department of Epidemiology and Public Health, University College London, London, UK
| | - Bridie Osman
- The Matilda Centre for Research in Mental Health and Substance Use, Level 6, Jane Foss Russell Building, G02, The University of Sydney, NSW2006, Australia
| | - Hayley North
- Neuroscience Research Australia, Randwick, NSW2031, Australia
| | - Nisha Mathew
- Neuroscience Research Australia, Randwick, NSW2031, Australia; School of Clinical Medicine, UNSW Medicine and Health, University of New South Wales, NSW 2052, Australia
| | - Tim Slade
- The Matilda Centre for Research in Mental Health and Substance Use, Level 6, Jane Foss Russell Building, G02, The University of Sydney, NSW2006, Australia
| |
Collapse
|
9
|
Tangirala S, Tierney BT, Patel CJ. Prioritization of COVID-19 risk factors in July 2020 and February 2021 in the UK. COMMUNICATIONS MEDICINE 2023; 3:45. [PMID: 36997659 PMCID: PMC10062272 DOI: 10.1038/s43856-023-00271-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 03/07/2023] [Indexed: 04/04/2023] Open
Abstract
BACKGROUND Risk for COVID-19 positivity and hospitalization due to diverse environmental and sociodemographic factors may change as the pandemic progresses. METHODS We investigated the association of 360 exposures sampled before COVID-19 outcomes for participants in the UK Biobank, including 9268 and 38,837 non-overlapping participants, sampled at July 17, 2020 and February 2, 2021, respectively. The 360 exposures included clinical biomarkers (e.g., BMI), health indicators (e.g., doctor-diagnosed diabetes), and environmental/behavioral variables (e.g., air pollution) measured 10-14 years before the COVID-19 time periods. RESULTS Here we show, for example, "participant having son and/or daughter in household" was associated with an increase in incidence from 20% to 32% (risk difference of 12%) between timepoints. Furthermore, we find age to be increasingly associated with COVID-19 positivity over time from Risk Ratio [RR] (per 10-year age increase) of 0.81 to 0.6 (hospitalization RR from 1.18 to 2.63, respectively). CONCLUSIONS Our data-driven approach demonstrates that time of pandemic plays a role in identifying risk factors associated with positivity and hospitalization.
Collapse
Affiliation(s)
- Sivateja Tangirala
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Statistics and Data Science, Cornell University, Ithaca, NY, USA
| | - Braden T Tierney
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chirag J Patel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
10
|
Levitt M, Zonta F, Ioannidis J. Excess death estimates from multiverse analysis in 2009-2021. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2022.09.21.22280219. [PMID: 36172123 PMCID: PMC9516863 DOI: 10.1101/2022.09.21.22280219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Excess death estimates have great value in public health, but they can be sensitive to analytical choices. Here we propose a multiverse analysis approach that considers all possible different time periods for defining the reference baseline and a range of 1 to 4 years for the projected time period for which excess deaths are calculated. We used data from the Human Mortality Database on 33 countries with detailed age-stratified death information on an annual basis during the period 2009-2021. The use of different time periods for reference baseline led to large variability in the absolute magnitude of the exact excess death estimates. However, the relative ranking of different countries compared to others for specific years remained largely unaltered. The relative ranking of different years for the specific country was also largely independent of baseline. Averaging across all possible analyses, distinct time patterns were discerned across different countries. Countries had declines between 2009 and 2019, but the steepness of the decline varied markedly. There were also large differences across countries on whether the COVID-19 pandemic years 2020-2021 resulted in an increase of excess deaths and by how much. Consideration of longer projected time windows resulted in substantial shrinking of the excess deaths in many, but not all countries. Multiverse analysis of excess deaths over long periods of interest can offer a more unbiased approach to understand comparative mortality trends across different countries, the range of uncertainty around estimates, and the nature of observed mortality peaks.
Collapse
|
11
|
Hardwicke TE, Wagenmakers EJ. Reducing bias, increasing transparency and calibrating confidence with preregistration. Nat Hum Behav 2023; 7:15-26. [PMID: 36707644 DOI: 10.1038/s41562-022-01497-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 11/09/2022] [Indexed: 01/29/2023]
Abstract
Flexibility in the design, analysis and interpretation of scientific studies creates a multiplicity of possible research outcomes. Scientists are granted considerable latitude to selectively use and report the hypotheses, variables and analyses that create the most positive, coherent and attractive story while suppressing those that are negative or inconvenient. This creates a risk of bias that can lead to scientists fooling themselves and fooling others. Preregistration involves declaring a research plan (for example, hypotheses, design and statistical analyses) in a public registry before the research outcomes are known. Preregistration (1) reduces the risk of bias by encouraging outcome-independent decision-making and (2) increases transparency, enabling others to assess the risk of bias and calibrate their confidence in research outcomes. In this Perspective, we briefly review the historical evolution of preregistration in medicine, psychology and other domains, clarify its pragmatic functions, discuss relevant meta-research, and provide recommendations for scientists and journal editors.
Collapse
Affiliation(s)
- Tom E Hardwicke
- Department of Psychology, University of Amsterdam, Amsterdam, the Netherlands.
| | | |
Collapse
|
12
|
Gouraud H, Wallach JD, Boussageon R, Ross JS, Naudet F. Vibration of effect in more than 16 000 pooled analyses of individual participant data from 12 randomised controlled trials comparing canagliflozin and placebo for type 2 diabetes mellitus: multiverse analysis. BMJ MEDICINE 2022; 1:e000154. [PMID: 36936564 PMCID: PMC9978683 DOI: 10.1136/bmjmed-2022-000154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 08/23/2022] [Indexed: 11/04/2022]
Abstract
Objective To evaluate the impact of conducting all possible pooled analyses across different combinations of randomised controlled trials and endpoints. Design Multiverse analysis, consisting of numerous pooled analyses of individual participant data. Setting Individual patient data from 12 randomised controlled trials comparing canagliflozin treatment with placebo, shared on the Yale University Open Data Access project (https://yoda.yale.edu/) platform, up to 16 April 2021. Participants 15 094 people with type 2 diabetes mellitus. Main outcome measures Pooled analyses estimated changes in serum glycated haemoglobin (HbA1c), major adverse cardiovascular events, and serious adverse events at weeks 12, 18, 26, and 52. The distribution of effect estimates was calculated for all possible combinations, and the direction and magnitude of the first and 99th centiles of effect estimates were compared. Results Across 16 332 distinct pooled analyses comparing canagliflozin with placebo for changes in HbA1c, standardised effect estimates were in favour of canagliflozin treatment at both the first centile (-0.75%) and 99th centile (-0.48%); 15 994 (97.93%) analyses showed significant results (P<0.05) in favour of canagliflozin. For major adverse cardiovascular events, estimated hazard ratios were 0.20 at the first centile and 0.90 at the 99th centile; 2705 of 8144 analyses (33.21%) were significant, all of which were in favour of canagliflozin treatment. For serious adverse events, estimated hazard ratios were 0.59 at the first centile and 1.14 at the 99th centile; 5793 of 16 332 (35.47%) analyses were significant, with 5754 in favour of canagliflozin and 39 in favour of placebo. Conclusion Results from pooled analyses can be subject to vibration of effects and should be critically appraised, especially regarding the risk for selection and availability bias in individual participant data retrieved.
Collapse
Affiliation(s)
- Henri Gouraud
- Inserm, CIC 1414 (Centre d’Investigation Clinique de Rennes), Rennes 1 University, Rennes, France
- Inserm, Irset (Institut de recherche en santé, environnement et travail), Rennes 1 University, Rennes, France
| | - Joshua D Wallach
- Department of Environmental Health Sciences, Yale University School of Public Health, New Haven, CT, USA
| | - Rémy Boussageon
- UCBL, CNRS, UMR 5558, LBBE, EMET, University Claude Bernard Lyon 1, Villeurbanne, France
| | - Joseph S Ross
- Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Florian Naudet
- Inserm, CIC 1414 (Centre d’Investigation Clinique de Rennes), Rennes 1 University, Rennes, France
- Inserm, Irset (Institut de recherche en santé, environnement et travail), Rennes 1 University, Rennes, France
- Institut Universitaire de France, Paris, France
| |
Collapse
|
13
|
Ioannidis JP. Pre-registration of mathematical models. Math Biosci 2022; 345:108782. [DOI: 10.1016/j.mbs.2022.108782] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 01/13/2022] [Accepted: 01/13/2022] [Indexed: 11/28/2022]
|
14
|
Gene-level metagenomic architectures across diseases yield high-resolution microbiome diagnostic indicators. Nat Commun 2021; 12:2907. [PMID: 34006865 PMCID: PMC8131609 DOI: 10.1038/s41467-021-23029-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 04/13/2021] [Indexed: 02/06/2023] Open
Abstract
We propose microbiome disease “architectures”: linking >1 million microbial features (species, pathways, and genes) to 7 host phenotypes from 13 cohorts using a pipeline designed to identify associations that are robust to analytical model choice. Here, we quantify conservation and heterogeneity in microbiome-disease associations, using gene-level analysis to identify strain-specific, cross-disease, positive and negative associations. We find coronary artery disease, inflammatory bowel diseases, and liver cirrhosis to share gene-level signatures ascribed to the Streptococcus genus. Type 2 diabetes, by comparison, has a distinct metagenomic signature not linked to any one specific species or genus. We additionally find that at the species-level, the prior-reported connection between Solobacterium moorei and colorectal cancer is not consistently identified across models—however, our gene-level analysis unveils a group of robust, strain-specific gene associations. Finally, we validate our findings regarding colorectal cancer and inflammatory bowel diseases in independent cohorts and identify that features inversely associated with disease tend to be less reproducible than features enriched in disease. Overall, our work is not only a step towards gene-based, cross-disease microbiome diagnostic indicators, but it also illuminates the nuances of the genetic architecture of the human microbiome, including tension between gene- and species-level associations. Here, combing the massive gene-universe of the gut microbiome to identify strain-specific, cross-disease, associations across seven human diseases, the authors introduce the concept of microbiome architecture, defined as the complete set of positive and negative associations between microbial genes and human host disease, highlighting microbiome architectures as potential diagnostic indicators.
Collapse
|