A Random‐Effects Approach to Regression Involving Many Categorical Predictors and Their Interactions

Hanmei Sun et al.

Australian and New Zealand Journal of Statistics2026https://doi.org/10.1111/anzs.70034article
ABDC A
Weight
0.50

Abstract

Linear model prediction with a large number of potential predictors is both statistically and computationally challenging. The traditional approaches are largely based on shrinkage selection/estimation methods, which are applicable even when the number of potential predictors is (much) larger than the sample size. A situation of the latter scenario occurs when the candidate predictors involve many binary indicators corresponding to categories of some categorical predictors as well as their interactions. We propose an alternative approach to the shrinkage prediction methods in such a case based on mixed model prediction, which effectively treats combinations of the categorical effects as random effects. We establish theoretical validity of the proposed method and demonstrate empirically its advantage over the shrinkage methods. We also develop measures of uncertainty for the proposed method and evaluate their performance empirically. A real‐data example is considered.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1111/anzs.70034

Or copy a formatted citation

@article{hanmei2026,
  title        = {{A Random‐Effects Approach to Regression Involving Many Categorical Predictors and Their Interactions}},
  author       = {Hanmei Sun et al.},
  journal      = {Australian and New Zealand Journal of Statistics},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1111/anzs.70034},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

A Random‐Effects Approach to Regression Involving Many Categorical Predictors and Their Interactions

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.