Causal Inference Without Complete Information: A Closed‐form Solution
Fernando Moreira
Abstract
Understanding the effect of certain factors or interventions is the objective of many researchers and decision‐makers. However, when using quantitative analyses for this purpose, it is virtually impossible to include all relevant data, which often leads to biased coefficients that only indicate correlation rather than effect. To help overcome this limitation, we introduce relatively simple formulas (closed‐form solutions) to estimate direct relationships between variables in the presence of unobserved factors (confounders) that simultaneously affect the two variables of interest. The estimated coefficient can be interpreted as causal when it is possible to rule out bidirectional relationships. Simulations show that our estimates match the unbiased coefficients up to 15 decimal places when the controls are independent of the omitted variables, significantly outperforming the respective ordinary least squares (OLS) estimates in all scenarios considered. We also find that, even with moderate correlation between the controls and the unobserved variables (up to 0.40), our method leads to estimates closer to the actual causal coefficients as compared to the OLS estimates in more than 70% of the cases. A real‐world dataset is used to illustrate the application of the method in the field of management.
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.