Validation of machine learning based scenario generators

Gero Junike et al.

Journal of Risk and Insurance2026https://doi.org/10.1111/jori.70040article
AJG 3ABDC A
Weight
0.50

Abstract

Machine learning (ML) methods are becoming increasingly important for designing economic scenario generators for internal models. Validating data‐driven models requires different methods than validating classical, theory‐based models. We discuss two novel aspects of such validation: first, checking the multivariate distribution of risk factors, and second, detecting unwanted memorization effects. The first task is necessary because, in ML‐based methods, dependencies are driven by data rather than derived from a financial‐mathematical theory. To address this first issue, we propose using an existing test from the literature. The second task is necessary because it cannot be ruled out that ML‐based models merely reproduce empirical data rather than generating new scenarios. For the second issue, we introduce a novel memorization ratio together with a thorough discussion. We include numerical experiments based on real market data and validate a simple autoencoder‐based scenario generator.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1111/jori.70040

Or copy a formatted citation

@article{gero2026,
  title        = {{Validation of machine learning based scenario generators}},
  author       = {Gero Junike et al.},
  journal      = {Journal of Risk and Insurance},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1111/jori.70040},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Validation of machine learning based scenario generators

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.