Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data
Yu-Chien Bo Ning
Abstract
This paper explores the multiple testing problem for sparse high-dimensional data with binary outcomes. We propose novel empirical Bayes multiple testing procedures based on a spike-and-slab posterior and then evaluate their performance in controlling the false discovery rate (FDR). A surprising finding is that the procedure using the default conjugate prior (namely, the ℓ-value procedure) can be overly conservative in estimating the FDR. To address this, we introduce two new procedures that provide accurate FDR control. Sharp frequentist theoretical results are established for these procedures, and numerical experiments are conducted to validate our theory in finite samples. To the best of our knowledge, we obtain the first uniform FDR control result in multiple testing for high-dimensional data with binary outcomes under the sparsity assumption.
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.