From point to probabilistic gradient boosting for claim frequency and severity prediction

Dominik Chevalier & Marie‐Pier Côté

European Actuarial Journal2025https://doi.org/10.1007/s13385-025-00428-5article
AJG 2ABDC B
Weight
0.44

Abstract

Gradient boosting for decision tree algorithms are increasingly used in actuarial applications as they show superior predictive performance over traditional generalised linear models. Many enhancements to the first gradient boosting machine algorithm exist. We present in a unified notation, and contrast, all the existing point and probabilistic gradient boosting for decision tree algorithms: GBM, XGBoost, DART, LightGBM, CatBoost, EGBM, PGBM, XGBoostLSS, cyclic GBM, and NGBoost. In this comprehensive numerical study, we compare their performance on five publicly available datasets for claim frequency and severity, of various sizes and comprising different numbers of (high cardinality) categorical variables. We explain how varying exposure-to-risk can be handled with boosting in frequency models. We compare the algorithms on the basis of computational efficiency, predictive performance, and model adequacy. LightGBM and XGBoostLSS win in terms of computational efficiency. CatBoost sometimes improves predictive performance, especially in the presence of high cardinality categorical variables, common in actuarial science. The fully interpretable EGBM achieves competitive predictive performance compared to the black box algorithms considered. We find that there is no trade-off between model adequacy and predictive accuracy: both are achievable simultaneously.

3 citations

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1007/s13385-025-00428-5

Or copy a formatted citation

@article{dominik2025,
  title        = {{From point to probabilistic gradient boosting for claim frequency and severity prediction}},
  author       = {Dominik Chevalier & Marie‐Pier Côté},
  journal      = {European Actuarial Journal},
  year         = {2025},
  doi          = {https://doi.org/https://doi.org/10.1007/s13385-025-00428-5},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

From point to probabilistic gradient boosting for claim frequency and severity prediction

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.44

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.32 × 0.4 = 0.13
M · momentum0.57 × 0.15 = 0.09
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.