Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data

Yu-Chien Bo Ning

Bernoulli2026https://doi.org/10.3150/25-bej1904article
ABDC A
Weight
0.50

Abstract

This paper explores the multiple testing problem for sparse high-dimensional data with binary outcomes. We propose novel empirical Bayes multiple testing procedures based on a spike-and-slab posterior and then evaluate their performance in controlling the false discovery rate (FDR). A surprising finding is that the procedure using the default conjugate prior (namely, the ℓ-value procedure) can be overly conservative in estimating the FDR. To address this, we introduce two new procedures that provide accurate FDR control. Sharp frequentist theoretical results are established for these procedures, and numerical experiments are conducted to validate our theory in finite samples. To the best of our knowledge, we obtain the first uniform FDR control result in multiple testing for high-dimensional data with binary outcomes under the sparsity assumption.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.3150/25-bej1904

Or copy a formatted citation

@article{yu-chien2026,
  title        = {{Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data}},
  author       = {Yu-Chien Bo Ning},
  journal      = {Bernoulli},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.3150/25-bej1904},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.