Finding Distributions that Differ, with False Discovery Rate Control

Yonghoon Lee et al.

Biometrika2026https://doi.org/10.1093/biomet/asag025article
AJG 4ABDC A*
Weight
0.50

Abstract

Summary We consider the problem of comparing a reference distribution with several other distributions. Given a sample from both the reference and the comparison groups, we aim to identify the comparison groups whose distributions differ from that of the reference group. Viewing this as a multiple-testing problem, we introduce a methodology that provides exact, distribution-free control of the false discovery rate. To do so, we introduce the concept of batch conformal p-values and demonstrate that they satisfy positive regression dependence across the groups (Benjamini & Yekutieli, 2001), thereby enabling control of the false discovery rate through the Benjamini–Hochberg procedure. The proof of positive regression dependence introduces a novel technique for the inductive construction of rank vectors with almost-sure dominance under exchangeability. We evaluate the performance of the proposed procedure through simulations. Despite being distribution-free, in some cases it shows performance comparable to methods with knowledge of the data-generating normal distribution, and it further has more power than direct approaches based on conformal out-of-distribution detection. Furthermore, we illustrate our methods on a hepatitis C treatment dataset, where they identify patient groups with large treatment effects, and on the Current Population Survey dataset, where they identify subpopulations with long working hours.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1093/biomet/asag025

Or copy a formatted citation

@article{yonghoon2026,
  title        = {{Finding Distributions that Differ, with False Discovery Rate Control}},
  author       = {Yonghoon Lee et al.},
  journal      = {Biometrika},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1093/biomet/asag025},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Finding Distributions that Differ, with False Discovery Rate Control

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.