Dual evaluation of performance and fairness from machine learning models for non-life insurance pricing

Tarun Israni et al.

British Actuarial Journal2026https://doi.org/10.1017/s1357321725100317article
AJG 1ABDC B
Weight
0.50

Abstract

An increasing number of reports highlight the potential of machine learning (ML) methodologies over the conventional generalised linear model (GLM) for non-life insurance pricing. In parallel, national and international regulatory institutions are accentuating their focus on pricing fairness to quantify and mitigate algorithmic differences and discrimination. However, comprehensive studies that assess both pricing accuracy and fairness remain scarce. We propose a benchmark of the GLM against mainstream regularised linear models and tree-based ensemble models under two popular distribution modelling strategies (Poisson-gamma and Tweedie), with respect to key criteria including estimation bias, deviance, risk differentiation, competitiveness, loss ratios, discrimination and fairness. Pricing performance and fairness were assessed simultaneously on the same samples of premium estimates for GLM and ML models. The models were compared on two open-access motor insurance datasets, each with a different type of cover (fully comprehensive and third-party liability). While no single ML model outperformed across both pricing and discrimination metrics, the GLM significantly underperformed for most. The results indicate that ML may be considered a realistic and reasonable alternative to current practices. We advocate that benchmarking exercises for risk prediction models should be carried out to assess both pricing accuracy and fairness for any given portfolio.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1017/s1357321725100317

Or copy a formatted citation

@article{tarun2026,
  title        = {{Dual evaluation of performance and fairness from machine learning models for non-life insurance pricing}},
  author       = {Tarun Israni et al.},
  journal      = {British Actuarial Journal},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1017/s1357321725100317},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Dual evaluation of performance and fairness from machine learning models for non-life insurance pricing

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.