Learning in random utility models via online decision problems

Emerson Melo

International Journal of Economic Theory2025https://doi.org/10.1111/ijet.70006article

AJG 2ABDC B

Weight

0.50

What the paper says

This paper examines the Random Utility Model (RUM) in repeated stochastic choice settings where decision‐makers lack full information about payoffs. We propose a gradient‐based learning algorithm that embeds RUM into an online decision‐making framework. Our analysis establishes Hannan consistency for a broad class of RUMs, meaning the average regret relative to the best fixed action in hindsight vanishes over time. We also show that our algorithm is equivalent to the Follow‐The‐Regularized‐Leader method, offering an economically grounded approach to online optimization. Applications include modeling recency bias and characterizing coarse correlated equilibria in normal‐form games.

Open paper page →

Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact	0.50 × 0.4 = 0.20
M · momentum	0.50 × 0.15 = 0.07
V · venue signal	0.50 × 0.05 = 0.03
R · text relevance †	0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.