Sovereign credit rating classification: evaluating the accuracy and driving factors using machine learning techniques
Fábio Henrique Sousa Coelho et al.
Abstract
Purpose This paper explores the effectiveness of machine learning algorithms in predicting sovereign credit ratings, benchmarking their performance against traditional linear and panel regression models. Design/methodology/approach The analysis, which incorporates macroeconomic, political, and institutional variables as predictors, utilizes data from 41 countries between 2000 and 2018. Supervised learning techniques, including Random Forest, Neural Networks, XGBoost, and K-Nearest Neighbors (KNN), are employed in conjunction with feature selection methods such as Lasso and Ridge. Findings The results demonstrate that machine learning models, particularly Random Forest, deliver superior predictive accuracy, outperforming even fixed-effects panel models. Random Forest accurately classified slightly over 40% of the ratings, compared to just over 30% for the second-best model. Originality/value These findings underscore machine learning approaches’ flexibility and predictive strength, which operate with fewer assumptions and effectively capture complex interactions and nonlinear relationships among variables. Additionally, the study reaffirms the central role of institutional quality and political stability in determining sovereign credit ratings, contributing to the expanding use of computational tools in economics, particularly for classification and forecasting tasks.
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.