Operating characteristics of Bayes factors

Fulvio De Santis et al.

Journal of the Royal Statistical Society. Series A: Statistics in Society2026https://doi.org/10.1093/jrsssa/qnag044article
AJG 3
Weight
0.50

Abstract

Bayes factors are Bayesian test statistics whose numerical post-experimental values quantify the evidence provided by an experiment in favour of a statistical hypothesis with respect to an alternative hypothesis. Their use in clinical trials has been often advocated, conditional to explicit control of their frequentist operating characteristics, namely type-one error and power. In this article we revisit and further explore the formal relationships between Bayes factors and standard optimal frequentist testing procedures for both one-sided and two-sided hypotheses on a normal mean, frameworks that are widely used in experimental contexts. With regard to tests based on Bayes factors, these relationships allow one: (i) to prove frequentist optimality and to highlight differences between the one-sided and the two-sided cases; (ii) to obtain explicit formulas to compute type-one error and power from those of the related frequentist test; and (iii) to determine the expression of the so-called probability of success from that of the frequentist test. We illustrate the main ideas in several numerical examples and in an application on the use of mirtazapine in dementia patients, where the sample size determination problem is also considered.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1093/jrsssa/qnag044

Or copy a formatted citation

@article{fulvio2026,
  title        = {{Operating characteristics of Bayes factors}},
  author       = {Fulvio De Santis et al.},
  journal      = {Journal of the Royal Statistical Society. Series A: Statistics in Society},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1093/jrsssa/qnag044},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Operating characteristics of Bayes factors

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.