Safe testing

Peter Grünwald*, Rianne de Heide, Wouter Koolen

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

We develop the theory of hypothesis testing based on the e-value, a notion of evidence that, unlike the p-value, allows for effortlessly combining results from several studies in the common scenario where the decision to perform a new study may depend on previous outcomes. Tests based on e-values are safe, i.e. they preserve type-I error guarantees, under such optional continuation. We define growth rate optimality (GRO) as an analogue of power in an optional continuation context, and we show how to construct GRO e-variables for general testing problems with composite null and alternative, emphasizing models with nuisance parameters. GRO e-values take the form of Bayes factors with special priors. We illustrate the theory using several classic examples including a 1-sample safe t-test and the 2 × 2 contingency table. Sharing Fisherian, Neymanian, and Jeffreys–Bayesian interpretations, e-values may provide a methodology acceptable to adherents of all three schools.

Original languageEnglish
Pages (from-to)1091-1128
Number of pages38
JournalJournal of the Royal Statistical Society. Series B: Statistical Methodology
Volume86
Issue number5
Early online date7 Mar 2024
DOIs
Publication statusPublished - Nov 2024

Bibliographical note

PDF is enlarged with discussion paper contributions (81 pages).

Publisher Copyright:
© The Royal Statistical Society 2024. All rights reserved.

Keywords

  • Bayes factors
  • e-values
  • hypothesis testing
  • information projection
  • optional stopping
  • test martingales

Fingerprint

Dive into the research topics of 'Safe testing'. Together they form a unique fingerprint.
  • Safe Testing

    Grunwald, P., De Heide, R. & Koolen, W. M., 2020, arXiv, p. 1-47, 47 p.

    Research output: Working paper / PreprintPreprintAcademic

    Open Access

Cite this