Learning in random utility models via online decision problems

Emerson Melo

International Journal of Economic Theory2025https://doi.org/10.1111/ijet.70006article
AJG 2ABDC B
Weight
0.50

Abstract

This paper examines the Random Utility Model (RUM) in repeated stochastic choice settings where decision‐makers lack full information about payoffs. We propose a gradient‐based learning algorithm that embeds RUM into an online decision‐making framework. Our analysis establishes Hannan consistency for a broad class of RUMs, meaning the average regret relative to the best fixed action in hindsight vanishes over time. We also show that our algorithm is equivalent to the Follow‐The‐Regularized‐Leader method, offering an economically grounded approach to online optimization. Applications include modeling recency bias and characterizing coarse correlated equilibria in normal‐form games.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1111/ijet.70006

Or copy a formatted citation

@article{emerson2025,
  title        = {{Learning in random utility models via online decision problems}},
  author       = {Emerson Melo},
  journal      = {International Journal of Economic Theory},
  year         = {2025},
  doi          = {https://doi.org/https://doi.org/10.1111/ijet.70006},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Learning in random utility models via online decision problems

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.