Estimating Ratios of Means of Multicategory Data Observed with Sample and Category Perturbations

D S Clausen et al.

Biometrika2026https://doi.org/10.1093/biomet/asag009article
AJG 4ABDC A*
Weight
0.50

Abstract

Summay We consider the problem of estimating ratios of means of a multivariate outcome across covariates when the data are observed with unknown sample-specific and category-specific perturbations. Our model admits a partially identifiable estimand, and we establish full identifiability by imposing interpretable parameter constraints. To reduce bias and guarantee the existence of estimators in the presence of sparse observations, we apply an asymptotically negligible and constraint-invariant penalty to the loss function. We develop a fast coordinate-descent algorithm for estimation, and an augmented Lagrangian algorithm for estimation under null hypotheses. We construct a model-robust score test and demonstrate valid inference even for small sample sizes and under violated distributional assumptions. The flexibility of the approach, and comparisons with related methods, are illustrated through a simulation study and a meta-analysis of microbial associations with colorectal cancer.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1093/biomet/asag009

Or copy a formatted citation

@article{d2026,
  title        = {{Estimating Ratios of Means of Multicategory Data Observed with Sample and Category Perturbations}},
  author       = {D S Clausen et al.},
  journal      = {Biometrika},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1093/biomet/asag009},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Estimating Ratios of Means of Multicategory Data Observed with Sample and Category Perturbations

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.