Using Coreference Resolution to Mitigate Measurement Error in Text Analysis
Farhan Iqbal & Michael D. Pfarrer
Abstract
Content analysis has enabled organizational scholars to study constructs and relationships that were previously unattainable at scale. One particular area of focus has been on sentiment analysis, which scholars have implemented to examine myriad relationships pertinent to organizational research. This article addresses certain limitations in sentiment analysis. More specifically, we bring attention to the challenge of accurately attributing sentiment in text that mentions multiple firms. Whereas traditional methods often result in measurement error due to misattributing text to firms, we offer coreference resolution—a natural language processing technique that identifies and links expressions referring to the same entity—as a solution to this problem. Across two studies, we demonstrate the potential of this approach to reduce measurement error and enhance the veracity of text analyses. We conclude by offering avenues for theoretical and empirical advances in organizational research.
1 citation
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.16 × 0.4 = 0.06 |
| M · momentum | 0.53 × 0.15 = 0.08 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.