1
|
Kelter R. The evidence interval and the Bayesian evidence value: On a unified theory for Bayesian hypothesis testing and interval estimation. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2022; 75:550-592. [PMID: 36200811 DOI: 10.1111/bmsp.12267] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 01/06/2022] [Indexed: 06/16/2023]
Abstract
Interval estimation is one of the most frequently used methods in statistical science, employed to provide a range of credible values a parameter is located in after taking into account the uncertainty in the data. However, while this interpretation only holds for Bayesian interval estimates, these suffer from two problems. First, Bayesian interval estimates can include values which have not been corroborated by observing the data. Second, Bayesian interval estimates and hypothesis tests can yield contradictory conclusions. In this paper a new theory for Bayesian hypothesis testing and interval estimation is presented. A new interval estimate is proposed, the Bayesian evidence interval, which is inspired by the Pereira-Stern theory of the full Bayesian significance test (FBST). It is shown that the evidence interval is a generalization of existing Bayesian interval estimates, that it solves the problems of standard Bayesian interval estimates and that it unifies Bayesian hypothesis testing and parameter estimation. The Bayesian evidence value is introduced, which quantifies the evidence for the (interval) null and alternative hypothesis. Based on the evidence interval and the evidence value, the (full) Bayesian evidence test (FBET) is proposed as a new, model-independent Bayesian hypothesis test. Additionally, a decision rule for hypothesis testing is derived which shows the relationship to a widely used decision rule based on the region of practical equivalence and Bayesian highest posterior density intervals and to the e-value in the FBST. In summary, the proposed method is universally applicable, computationally efficient, and while the evidence interval can be seen as an extension of existing Bayesian interval estimates, the FBET is a generalization of the FBST and contains it as a special case. Together, the theory developed provides a unification of Bayesian hypothesis testing and interval estimation and is made available in the R package fbst.
Collapse
Affiliation(s)
- Riko Kelter
- Department of Mathematics, University of Siegen, Siegen, Germany
| |
Collapse
|
2
|
Belzile LR, Davison AC. Improved inference on risk measures for univariate extremes. Ann Appl Stat 2022. [DOI: 10.1214/21-aoas1555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
3
|
Cella L, Martin R. Direct and approximately valid probabilistic inference on a class of statistical functionals. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2022.09.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
4
|
Reid N. D. A. S. Fraser: From structural inference to asymptotics. CAN J STAT 2022. [DOI: 10.1002/cjs.11720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Nancy Reid
- Department of Statistical Sciences University of Toronto Toronto Ontario Canada
| |
Collapse
|
5
|
Mayo DG, Hand D. Statistical significance and its critics: practicing damaging science, or damaging scientific practice? SYNTHESE 2022; 200:220. [PMID: 35578622 PMCID: PMC9096069 DOI: 10.1007/s11229-022-03692-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2020] [Accepted: 04/05/2022] [Indexed: 05/27/2023]
Abstract
While the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools, and that in fact the purported remedies themselves risk damaging science. We argue that banning the use of p-value thresholds in interpreting data does not diminish but rather exacerbates data-dredging and biasing selection effects. If an account cannot specify outcomes that will not be allowed to count as evidence for a claim-if all thresholds are abandoned-then there is no test of that claim. The contributions of this paper are: To explain the rival statistical philosophies underlying the ongoing controversy; To elucidate and reinterpret statistical significance tests, and explain how this reinterpretation ameliorates common misuses and misinterpretations; To argue why recent recommendations to replace, abandon, or retire statistical significance undermine a central function of statistics in science: to test whether observed patterns in the data are genuine or due to background variability.
Collapse
|
6
|
Bhattacharya I, Martin R. Gibbs posterior inference on multivariate quantiles. J Stat Plan Inference 2022. [DOI: 10.1016/j.jspi.2021.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
7
|
Bickel DR. Interval estimation, point estimation, and null hypothesis significance testing calibrated by an estimated posterior probability of the null hypothesis. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1921805] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- David R. Bickel
- The Graduate School, Informatics and Analytics, University of North Carolina at Greensboro, Greensboro, NC, USA
| |
Collapse
|
8
|
Veronese P, Melilli E. Confidence Distribution for the Ability Parameter of the Rasch Model. PSYCHOMETRIKA 2021; 86:131-166. [PMID: 33534091 DOI: 10.1007/s11336-021-09747-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 09/10/2020] [Accepted: 01/08/2021] [Indexed: 06/12/2023]
Abstract
In this paper, we consider the Rasch model and suggest novel point estimators and confidence intervals for the ability parameter. They are based on a proposed confidence distribution (CD) whose construction has required to overcome some difficulties essentially due to the discrete nature of the model. When the number of items is large, the computations due to the combinatorics involved become heavy, and thus, we provide first- and second-order approximations of the CD. Simulation studies show the good behavior of our estimators and intervals when compared with those obtained through other standard frequentist and weakly informative Bayesian procedures. Finally, using the expansion of the expected length of the suggested interval, we are able to identify reasonable values of the sample size which lead to a desired length of the interval.
Collapse
Affiliation(s)
- Piero Veronese
- Department of Decision Sciences, Bocconi University, via Roentgen 1, 20136, Milano, Italy.
| | - Eugenio Melilli
- Department of Decision Sciences, Bocconi University, via Roentgen 1, 20136, Milano, Italy
| |
Collapse
|
9
|
The sufficiency of the evidence, the relevancy of the evidence, and quantifying both with a single number. STAT METHOD APPL-GER 2021. [DOI: 10.1007/s10260-020-00553-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
10
|
Model-free posterior inference on the area under the receiver operating characteristic curve. J Stat Plan Inference 2020. [DOI: 10.1016/j.jspi.2020.03.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
11
|
Peck J, Goossens B, Saeys Y. Detecting adversarial manipulation using inductive Venn-ABERS predictors. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.11.113] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
12
|
Bickel DR. Confidence distributions and empirical Bayes posterior distributions unified as distributions of evidential support. COMMUN STAT-THEOR M 2020. [DOI: 10.1080/03610926.2020.1790004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- David R. Bickel
- Department of Biochemistry, Microbiology, and Immunology, Department of Mathematics and Statistics, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Canada
| |
Collapse
|
13
|
Wang G, Sarkar A, Carbonetto P, Stephens M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J R Stat Soc Series B Stat Methodol 2020; 82:1273-1300. [DOI: 10.1111/rssb.12388] [Citation(s) in RCA: 176] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
14
|
|
15
|
Bickel DR, Patriota AG. Self-consistent confidence sets and tests of composite hypotheses applicable to restricted parameters. BERNOULLI 2019. [DOI: 10.3150/17-bej942] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
16
|
Fraser DAS, Reid N, Lin W. When should modes of inference disagree? Some simple but challenging examples. Ann Appl Stat 2018. [DOI: 10.1214/18-aoas1160sf] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
17
|
|
18
|
|
19
|
|
20
|
|
21
|
Fraser DAS, Bédard M, Wong A, Lin W, Fraser AM. Bayes, Reproducibility and the Quest for Truth. Stat Sci 2016. [DOI: 10.1214/16-sts573] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
22
|
Hannig J, Iyer H, Lai RCS, Lee TCM. Generalized Fiducial Inference: A Review and New Results. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2016.1165102] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
- Jan Hannig
- Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC, USA
| | - Hari Iyer
- Statistical Engineering Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Randy C. S. Lai
- Department of Statistics, University of California, Davis, Davis, CA, USA
- Department of Mathematics & Statistics, University of Maine, Orono, ME, USA
| | - Thomas C. M. Lee
- Department of Statistics, University of California, Davis, Davis, CA, USA
| |
Collapse
|
23
|
Martin R, Liu C. Marginal Inferential Models: Prior-Free Probabilistic Inference on Interest Parameters. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2014.985827] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
24
|
Affiliation(s)
- Youngjo Lee
- Department of Statistics; Seoul National University; Seoul Korea
| | - Gwangsu Kim
- Department of Statistics; Korea University; Seoul Korea
| |
Collapse
|
25
|
Moreno E, Girón J, Casella G. Posterior Model Consistency in Variable Selection as the Model Dimension Grows. Stat Sci 2015. [DOI: 10.1214/14-sts508] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
26
|
Bahamyirou A, Marchand É. On the discrepancy between Bayes credibility and frequentist probability of coverage. Stat Probab Lett 2015. [DOI: 10.1016/j.spl.2014.10.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
27
|
Affiliation(s)
- Nancy Reid
- Department of Statistics; University of Toronto; Toronto Canada
| | | |
Collapse
|
28
|
Liu C, Martin R. Frameworks for prior-free posterior probabilistic inference. ACTA ACUST UNITED AC 2014. [DOI: 10.1002/wics.1329] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Chuanhai Liu
- Department of Statistics; Purdue University; West Lafayette IN USA
| | - Ryan Martin
- Department of Mathematics, Statistics, and Computer Science; University of Illinois at Chicago; Chicago IL USA
| |
Collapse
|
29
|
Bickel DR. Small-scale Inference: Empirical Bayes and Confidence Methods for as Few as a Single Comparison. Int Stat Rev 2014. [DOI: 10.1111/insr.12064] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- David R. Bickel
- Ottawa Institute of Systems Biology, Department of Biochemistry, Microbiology, and Immunology, Department of Mathematics and Statistics; University of Ottawa; 451 Smyth Road Ottawa, Ontario K1H 8M5 Canada
| |
Collapse
|
30
|
|
31
|
Martin R, Liu C. Conditional inferential models: combining information for prior-free probabilistic inference. J R Stat Soc Series B Stat Methodol 2014. [DOI: 10.1111/rssb.12070] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
32
|
|
33
|
Hannig J. Discussion of “On the Birnbaum Argument for the Strong Likelihood Principle”. Stat Sci 2014. [DOI: 10.1214/14-sts474] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
34
|
Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. The deviance information criterion: 12 years on. J R Stat Soc Series B Stat Methodol 2014. [DOI: 10.1111/rssb.12062] [Citation(s) in RCA: 337] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
35
|
Fraser DAS. Discussion. Int Stat Rev 2013. [DOI: 10.1111/insr.12006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
36
|
Martin R, Liu C. Inferential Models: A Framework for Prior-Free Posterior Probabilistic Inference. J Am Stat Assoc 2013. [DOI: 10.1080/01621459.2012.747960] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
37
|
Xie MG, Singh K. Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review. Int Stat Rev 2013. [DOI: 10.1111/insr.12000] [Citation(s) in RCA: 180] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
38
|
|
39
|
Gelman A, Robert CP. “Not Only Defended But Also Applied”: The Perceived Absurdity of Bayesian Inference. AM STAT 2013. [DOI: 10.1080/00031305.2013.760987] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
40
|
Marchand É, Strawderman WE. On Bayesian credible sets, restricted parameter spaces and frequentist coverage. Electron J Stat 2013. [DOI: 10.1214/13-ejs806] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
41
|
Bickel DR. Controlling the degree of caution in statistical inference with the Bayesian and frequentist approaches as opposite extremes. Electron J Stat 2012. [DOI: 10.1214/12-ejs689] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
42
|
|