Double machine learning for static panel models with fixed effects
Paul Clarke & Annalivia Polselli
Abstract
Summary Recent advances in causal inference have seen the development of methods that make use of the predictive power of machine learning algorithms. In this paper, we develop novel double machine learning procedures for panel data in which these algorithms are used to approximate high-dimensional and non-linear nuisance functions of the covariates. Our new procedures are extensions of the well-known correlated random effects, within-group, and first-difference estimators from linear to non-linear panel models, specifically, the partially linear regression model with fixed effects and unspecified non-linear confounding. Our simulation study assesses the performance of these procedures using different machine learning algorithms. We use our procedures to re-estimate the impact of the introduction of the National Minimum Wage on voting behaviour in the United Kingdom. From our results, we recommend the use of first-differencing because it imposes the fewest constraints on the distribution of the fixed effects, and an ensemble learning strategy to ensure optimum estimator accuracy.
4 citations
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.37 × 0.4 = 0.15 |
| M · momentum | 0.60 × 0.15 = 0.09 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.