Anytime validity is free: inducing sequential tests

Nick W. Koning & Sam van Meer

Journal of the Royal Statistical Society. Series B: Statistical Methodology2026https://doi.org/10.1093/jrsssb/qkag050article
AJG 4
Weight
0.50

Abstract

Anytime valid sequential tests permit us to stop testing based on the current data, without invalidating the inference. Given a maximum number of observations N, one may believe this must come at the cost of power when compared to a conventional test that waits until all N observations have arrived. Our first contribution is to show that this is false: for any valid test based on N observations, we show how to construct an anytime valid sequential test that matches it after N observations. Our second contribution is that we may use the outcome of a [0,1]-valued test as a conditional significance level in subsequent testing, leading to an overall procedure that is valid at the original significance level. This shows that anytime validity and optional continuation are readily available in traditional testing, without requiring explicit use of e-values. We illustrate this by deriving the anytime valid sequentialized z-test and t-test, which at time N coincide with the traditional z-test and t-test. Finally, we characterize the Sequential Probability Ratio Test by invariance under test induction, and show that it is induced by the Neyman–Pearson test for a tiny level and huge N under an i.i.d. assumption.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1093/jrsssb/qkag050

Or copy a formatted citation

@article{nick2026,
  title        = {{Anytime validity is free: inducing sequential tests}},
  author       = {Nick W. Koning & Sam van Meer},
  journal      = {Journal of the Royal Statistical Society. Series B: Statistical Methodology},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1093/jrsssb/qkag050},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Anytime validity is free: inducing sequential tests

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.