Berry–Esseen bounds for design-based causal inference with possibly diverging treatment levels and varying group sizes

Lei Shi & Peng Ding

Annals of Statistics2026https://doi.org/10.1214/25-aos2569article
AJG 4*ABDC A*
Weight
0.50

Abstract

Neyman (1923/1990) introduced the randomization model, which contains the notation of potential outcomes to define causal effects and a framework for large-sample inference based on the design of the experiment. However, the existing theory for this framework is far from complete, especially when the number of treatment levels diverges and the treatment group sizes vary. We provide a unified discussion of statistical inference under the randomization model with general treatment group sizes. We formulate the estimator in terms of a linear permutation statistic and use results based on Stein’s method to derive various Berry–Esseen bounds on the linear and quadratic functions of the estimator. These new Berry–Esseen bounds serve as the basis for design-based causal inference with possibly diverging treatment levels and a diverging number of causal parameters of interest. We also fill an important gap by proposing novel variance estimators for experiments with possibly many treatment levels without replications. Equipped with the newly developed results, design-based causal inference in general settings becomes more convenient with stronger theoretical guarantees.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1214/25-aos2569

Or copy a formatted citation

@article{lei2026,
  title        = {{Berry–Esseen bounds for design-based causal inference with possibly diverging treatment levels and varying group sizes}},
  author       = {Lei Shi & Peng Ding},
  journal      = {Annals of Statistics},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1214/25-aos2569},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Berry–Esseen bounds for design-based causal inference with possibly diverging treatment levels and varying group sizes

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.