1
|
Zehetmayer S, Koenig F, Posch M. A general consonance principle for closure tests based on p-values. Stat Methods Med Res 2024; 33:1595-1609. [PMID: 39440585 DOI: 10.1177/09622802241269624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2024]
Abstract
The closure principle is a powerful approach to constructing efficient testing procedures controlling the familywise error rate in the strong sense. For small numbers of hypotheses and the setting of independent elementary p -values we consider closed tests where each intersection hypothesis is tested with a p -value combination test. Examples of such combination tests are the Fisher combination test, the Stouffer test, the Omnibus test, the truncated test, or the Wilson test. Some of these tests, such as the Fisher combination, the Stouffer, or the Omnibus test, are not consonant and rejection of the global null hypothesis does not always lead to rejection of at least one elementary null hypothesis. We develop a general principle to uniformly improve closed tests based on p -value combination tests by modifying the rejection regions such that the new procedure becomes consonant. For the Fisher combination test and the Stouffer test, we show by simulations that this improvement can lead to a substantial increase in power.
Collapse
Affiliation(s)
- Sonja Zehetmayer
- Center for Medical Data Science, Medical University of Vienna, Austria
| | - Franz Koenig
- Center for Medical Data Science, Medical University of Vienna, Austria
| | - Martin Posch
- Center for Medical Data Science, Medical University of Vienna, Austria
| |
Collapse
|
2
|
Ren J, Telschow FJE, Schwartzman A. Inverse set estimation and inversion of simultaneous confidence intervals. J R Stat Soc Ser C Appl Stat 2024; 73:1082-1109. [PMID: 39145308 PMCID: PMC11321826 DOI: 10.1093/jrsssc/qlae027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 01/11/2024] [Accepted: 04/19/2024] [Indexed: 08/16/2024]
Abstract
Motivated by the questions of risk assessment in climatology (temperature change in North America) and medicine (impact of statin usage and coronavirus disease 2019 on hospitalized patients), we address the problem of estimating the set in the domain of a function whose image equals a predefined subset of the real line. Existing methods require strict assumptions. We generalize the estimation of such sets to dense and nondense domains with protection against inflated Type I error in exploratory data analysis. This is achieved by proving that confidence sets of multiple upper, lower, or interval sets can be simultaneously constructed with the desired confidence nonasymptotically through inverting simultaneous confidence intervals. Nonparametric bootstrap algorithm and code are provided.
Collapse
Affiliation(s)
- Junting Ren
- Division of Biostatistics, University of California San Diego, La Jolla, CA, USA
| | | | - Armin Schwartzman
- Division of Biostatistics, University of California San Diego, La Jolla, CA, USA
- Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
3
|
Girardi P, Vesely A, Lakens D, Altoè G, Pastore M, Calcagnì A, Finos L. Post-selection Inference in Multiverse Analysis (PIMA): An Inferential Framework Based on the Sign Flipping Score Test. PSYCHOMETRIKA 2024; 89:542-568. [PMID: 38664342 DOI: 10.1007/s11336-024-09973-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Indexed: 06/11/2024]
Abstract
When analyzing data, researchers make some choices that are either arbitrary, based on subjective beliefs about the data-generating process, or for which equally justifiable alternative choices could have been made. This wide range of data-analytic choices can be abused and has been one of the underlying causes of the replication crisis in several fields. Recently, the introduction of multiverse analysis provides researchers with a method to evaluate the stability of the results across reasonable choices that could be made when analyzing data. Multiverse analysis is confined to a descriptive role, lacking a proper and comprehensive inferential procedure. Recently, specification curve analysis adds an inferential procedure to multiverse analysis, but this approach is limited to simple cases related to the linear model, and only allows researchers to infer whether at least one specification rejects the null hypothesis, but not which specifications should be selected. In this paper, we present a Post-selection Inference approach to Multiverse Analysis (PIMA) which is a flexible and general inferential approach that considers for all possible models, i.e., the multiverse of reasonable analyses. The approach allows for a wide range of data specifications (i.e., preprocessing) and any generalized linear model; it allows testing the null hypothesis that a given predictor is not associated with the outcome, by combining information from all reasonable models of multiverse analysis, and provides strong control of the family-wise error rate allowing researchers to claim that the null hypothesis can be rejected for any specification that shows a significant effect. The inferential proposal is based on a conditional resampling procedure. We formally prove that the Type I error rate is controlled, and compute the statistical power of the test through a simulation study. Finally, we apply the PIMA procedure to the analysis of a real dataset on the self-reported hesitancy for the COronaVIrus Disease 2019 (COVID-19) vaccine before and after the 2020 lockdown in Italy. We conclude with practical recommendations to be considered when implementing the proposed procedure.
Collapse
Affiliation(s)
- Paolo Girardi
- Department of Environmental Sciences, Informatics and Statistics, Ca' Foscari University of Venice, Via Torino 155, 30172, Venezia-Mestre, VE, Italy.
| | - Anna Vesely
- Department of Statistical Sciences, University of Bologna, Bologna, Italy
| | - Daniël Lakens
- Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, Eindhoven, The Netherlands
| | - Gianmarco Altoè
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
| | - Massimiliano Pastore
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
| | - Antonio Calcagnì
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
- GNCS Research Group, GNCS-INdAM RESEARCH GROUP, Rome, Italy
| | - Livio Finos
- Department of Statistical Sciences, University of Padova, Padua, Italy
| |
Collapse
|
4
|
Fischer L, Roig MB, Brannath W. An exhaustive ADDIS principle for online FWER control. Biom J 2024; 66:e2300237. [PMID: 38637319 DOI: 10.1002/bimj.202300237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 01/27/2024] [Accepted: 03/09/2024] [Indexed: 04/20/2024]
Abstract
In this paper, we consider online multiple testing with familywise error rate (FWER) control, where the probability of committing at least one type I error will remain under control while testing a possibly infinite sequence of hypotheses over time. Currently, adaptive-discard (ADDIS) procedures seem to be the most promising online procedures with FWER control in terms of power. Now, our main contribution is a uniform improvement of the ADDIS principle and thus of all ADDIS procedures. This means, the methods we propose reject as least as much hypotheses as ADDIS procedures and in some cases even more, while maintaining FWER control. In addition, we show that there is no other FWER controlling procedure that enlarges the event of rejecting any hypothesis. Finally, we apply the new principle to derive uniform improvements of the ADDIS-Spending and ADDIS-Graph.
Collapse
Affiliation(s)
- Lasse Fischer
- Competence Center for Clinical Trials Bremen, University of Bremen, Bremen, Germany
| | - Marta Bofill Roig
- Center for Medical Data Science, Medical University of Vienna, Vienna, Austria
| | - Werner Brannath
- Competence Center for Clinical Trials Bremen, University of Bremen, Bremen, Germany
| |
Collapse
|
5
|
Luo D, Ebadi A, Emery K, He Y, Noble WS, Keich U. Competition-based control of the false discovery proportion. Biometrics 2023; 79:3472-3484. [PMID: 36652258 DOI: 10.1111/biom.13830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 10/12/2022] [Accepted: 01/02/2023] [Indexed: 01/19/2023]
Abstract
Recently, Barber and Candès laid the theoretical foundation for a general framework for false discovery rate (FDR) control based on the notion of "knockoffs." A closely related FDR control methodology has long been employed in the analysis of mass spectrometry data, referred to there as "target-decoy competition" (TDC). However, any approach that aims to control the FDR, which is defined as the expected value of the false discovery proportion (FDP), suffers from a problem. Specifically, even when successfully controlling the FDR at level α, the FDP in the list of discoveries can significantly exceed α. We offer FDP-SD, a new procedure that rigorously controls the FDP in the knockoff/TDC competition setup by guaranteeing that the FDP is bounded by α at a desired confidence level. Compared with the recently published framework of Katsevich and Ramdas, FDP-SD generally delivers more power and often substantially so in simulated and real data.
Collapse
Affiliation(s)
- Dong Luo
- School of Mathematics and Statistics, University of Sydney, New South Wales, Australia
| | - Arya Ebadi
- School of Mathematics and Statistics, University of Sydney, New South Wales, Australia
| | - Kristen Emery
- School of Mathematics and Statistics, University of Sydney, New South Wales, Australia
| | - Yilun He
- School of Mathematics and Statistics, University of Sydney, New South Wales, Australia
| | | | - Uri Keich
- School of Mathematics and Statistics, University of Sydney, New South Wales, Australia
| |
Collapse
|
6
|
Heller R, Krieger A, Rosset S. Optimal multiple testing and design in clinical trials. Biometrics 2023; 79:1908-1919. [PMID: 35899317 DOI: 10.1111/biom.13726] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 07/22/2022] [Indexed: 02/01/2023]
Abstract
A central goal in designing clinical trials is to find the test that maximizes power (or equivalently minimizes required sample size) for finding a false null hypothesis subject to the constraint of type I error. When there is more than one test, such as in clinical trials with multiple endpoints, the issues of optimal design and optimal procedures become more complex. In this paper, we address the question of how such optimal tests should be defined and how they can be found. We review different notions of power and how they relate to study goals, and also consider the requirements of type I error control and the nature of the procedures. This leads us to an explicit optimization problem with objective and constraints that describe its specific desiderata. We present a complete solution for deriving optimal procedures for two hypotheses, which have desired monotonicity properties, and are computationally simple. For some of the optimization formulations this yields optimal procedures that are identical to existing procedures, such as Hommel's procedure or the procedure of Bittman et al. (2009), while for other cases it yields completely novel and more powerful procedures than existing ones. We demonstrate the nature of our novel procedures and their improved power extensively in a simulation and on the APEX study (Cohen et al., 2016).
Collapse
Affiliation(s)
- Ruth Heller
- Department of Statistics and Operations Research, Tel-Aviv University, Tel Aviv, Israel
| | - Abba Krieger
- Department of Statistics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Saharon Rosset
- Department of Statistics and Operations Research, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
7
|
Leday GGR, Hemerik J, Engel J, van der Voet H. Improved family-wise error rate control in multiple equivalence testing. Food Chem Toxicol 2023; 178:113928. [PMID: 37406754 DOI: 10.1016/j.fct.2023.113928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 06/20/2023] [Accepted: 06/30/2023] [Indexed: 07/07/2023]
Abstract
Equivalence testing is an important component of safety assessments, used for example by the European Food Safety Authority, to allow new food or feed products on the market. The aim of such tests is to demonstrate equivalence of characteristics of test and reference crops. Equivalence tests are typically univariate and applied to each measured analyte (characteristic) separately without multiplicity correction. This increases the probability of making false claims of equivalence (type I errors) when evaluating multiple analytes simultaneously. To solve this problem, familywise error rate (FWER) control using Hochberg's method has been proposed. This paper demonstrates that, in the context of equivalence testing, other FWER-controlling methods are more powerful than Hochberg's. Particularly, it is shown that Hommel's method is guaranteed to perform at least as well as Hochberg's and that an "adaptive" version of Bonferroni's method, which uses an estimator of the proportion of non-equivalent characteristics, often substantially outperforms Hommel's method. Adaptive Bonferroni takes better advantage of the particular context of food safety where a large proportion of true equivalences is expected, a situation where other methods are particularly conservative. The different methods are illustrated by their application to two compositional datasets and further assessed and compared using simulated data.
Collapse
Affiliation(s)
- Gwenaël G R Leday
- Wageningen University and Research, Biometris, Droevendaalsesteeg 1, 6708, PB, Wageningen, the Netherlands.
| | - Jesse Hemerik
- Wageningen University and Research, Biometris, Droevendaalsesteeg 1, 6708, PB, Wageningen, the Netherlands
| | - Jasper Engel
- Wageningen University and Research, Biometris, Droevendaalsesteeg 1, 6708, PB, Wageningen, the Netherlands
| | - Hilko van der Voet
- Wageningen University and Research, Biometris, Droevendaalsesteeg 1, 6708, PB, Wageningen, the Netherlands
| |
Collapse
|
8
|
Andreella A, Hemerik J, Finos L, Weeda W, Goeman J. Permutation-based true discovery proportions for functional magnetic resonance imaging cluster analysis. Stat Med 2023; 42:2311-2340. [PMID: 37259808 DOI: 10.1002/sim.9725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 11/24/2022] [Accepted: 03/18/2023] [Indexed: 06/02/2023]
Abstract
We propose a permutation-based method for testing a large collection of hypotheses simultaneously. Our method provides lower bounds for the number of true discoveries in any selected subset of hypotheses. These bounds are simultaneously valid with high confidence. The methodology is particularly useful in functional Magnetic Resonance Imaging cluster analysis, where it provides a confidence statement on the percentage of truly activated voxels within clusters of voxels, avoiding the well-known spatial specificity paradox. We offer a user-friendly tool to estimate the percentage of true discoveries for each cluster while controlling the family-wise error rate for multiple testing and taking into account that the cluster was chosen in a data-driven way. The method adapts to the spatial correlation structure that characterizes functional Magnetic Resonance Imaging data, gaining power over parametric approaches.
Collapse
Affiliation(s)
- Angela Andreella
- Department of Economics, Ca' Foscari University of Venice, Venice, Italy
| | - Jesse Hemerik
- Biometris, Wageningen University and Research, Wageningen, The Netherlands
| | - Livio Finos
- Department of Statistics, University of Padova, Padova, Italy
| | - Wouter Weeda
- Department of Psychology, Leiden University, Leiden, The Netherlands
| | - Jelle Goeman
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
9
|
Bogomolov M. Testing partial conjunction hypotheses under dependency, with applications to meta-analysis. Electron J Stat 2023. [DOI: 10.1214/22-ejs2100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Marina Bogomolov
- Faculty of Data and Decision Sciences, Technion - Israel Institute of Technology, Haifa 3200003, Israel
| |
Collapse
|
10
|
Vovk V, Wang R. Confidence and Discoveries with E-values. Stat Sci 2023. [DOI: 10.1214/22-sts874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Affiliation(s)
- Vladimir Vovk
- Vladimir Vovk is Professor, Department of Computer Science, Royal Holloway, University of London, Egham, Surrey, UK
| | - Ruodu Wang
- Ruodu Wang is Professor and University Research Chair, Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada
| |
Collapse
|
11
|
Xu N, Solari A, Goeman JJ. Closed testing with Globaltest, with application in metabolomics. Biometrics 2022. [PMID: 35567306 DOI: 10.1111/biom.13693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2021] [Accepted: 05/02/2022] [Indexed: 11/30/2022]
Abstract
The Globaltest is a powerful test for the global null hypothesis that there is no association between a group of features and a response of interest, which is popular in pathway testing in metabolomics. Evaluating multiple feature sets, however, requires multiple testing correction. In this paper, we propose a multiple testing method, based on closed testing, specifically designed for the Globaltest. The proposed method controls the family-wise error rate simultaneously over all possible feature sets, and therefore allows post hoc inference, i.e. the researcher may choose feature sets of interest after seeing the data without jeopardizing error control. To circumvent the exponential computation time of closed testing, we derive a novel shortcut that allows exact closed testing to be performed on the scale of metabolomics data. An R package ctgt is available on CRAN for the implementation of the shortcut procedure, with applications on several real metabolomics data examples. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Ningning Xu
- Department of Biomedical Data Sciences, Leiden University Medical Center, The Netherlands
| | - Aldo Solari
- Department of Economics, Management and Statistics, University of Milano-Bicocca, Italy
| | - Jelle J Goeman
- Department of Biomedical Data Sciences, Leiden University Medical Center, The Netherlands
| |
Collapse
|
12
|
Vovk V, Wang B, Wang R. Admissible ways of merging p-values under arbitrary dependence. Ann Stat 2022. [DOI: 10.1214/21-aos2109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Vladimir Vovk
- Department of Computer Science, Royal Holloway, University of London
| | - Bin Wang
- RCSDS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences
| | - Ruodu Wang
- Department of Statistics and Actuarial Science, University of Waterloo
| |
Collapse
|
13
|
Goeman JJ, Solari A. Comparing Three Groups. AM STAT 2021. [DOI: 10.1080/00031305.2021.2002188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Jelle J. Goeman
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - Aldo Solari
- Department of Economics, Management and Statistics, University of Milano-Bicocca, Milan, Italy
| |
Collapse
|