TY - UNPB
T1 - Safe Testing
AU - Grunwald, P.
AU - De Heide, R.
AU - Koolen, W.M.
PY - 2023/3/10
Y1 - 2023/3/10
N2 - We develop the theory of hypothesis testing based on the e-value, a notion of evidencethat, unlike the p-value, allows for effortlessly combining results from several studies in thecommon scenario where the decision to perform a new study may depend on previous out-comes. Tests based on e-values are safe, i.e. they preserve Type-I error guarantees, undersuch optional continuation. We define growth-rate optimality (GRO) as an analogue ofpower in an optional continuation context, and we show how to construct GRO e-variablesfor general testing problems with composite null and alternative, emphasizing models withnuisance parameters. GRO e-values take the form of Bayes factors with special priors.We illustrate the theory using several classic examples including a one-sample safe t-testand the 2 × 2 contingency table. Sharing Fisherian, Neymanian and Jeffreys-Bayesianinterpretations, e-values may provide a methodology acceptable to adherents of all threeschools.
AB - We develop the theory of hypothesis testing based on the e-value, a notion of evidencethat, unlike the p-value, allows for effortlessly combining results from several studies in thecommon scenario where the decision to perform a new study may depend on previous out-comes. Tests based on e-values are safe, i.e. they preserve Type-I error guarantees, undersuch optional continuation. We define growth-rate optimality (GRO) as an analogue ofpower in an optional continuation context, and we show how to construct GRO e-variablesfor general testing problems with composite null and alternative, emphasizing models withnuisance parameters. GRO e-values take the form of Bayes factors with special priors.We illustrate the theory using several classic examples including a one-sample safe t-testand the 2 × 2 contingency table. Sharing Fisherian, Neymanian and Jeffreys-Bayesianinterpretations, e-values may provide a methodology acceptable to adherents of all threeschools.
U2 - 10.48550/arXiv.1906.07801
DO - 10.48550/arXiv.1906.07801
M3 - Preprint
SP - 1
EP - 47
BT - Safe Testing
PB - arXiv
ER -