Trials and Triangulations: Analyzing Aggregation Sensitivity in Event Data on Political Violence
Sebastian van Baalen & Kristine Höglund
Abstract
Event datasets are indispensable for advancing research and informing policy in international studies. However, the process by which researchers aggregate information from multiple reports into a single event record remains neglected. Responding to calls for greater data transparency, we emphasize the importance of compiling datasets at the event report level and employing systematic, replicable aggregation procedures. We argue that existing aggregation methods, such as relying on median or maximum values, are insufficient for handling complex datasets on international relations and political violence. To address this issue, we develop three distinct aggregation models tailored to more complicated data structures. Using novel event report-level data from the Modes and Agents of Election-Related Violence in Côte d’Ivoire and Kenya (MAVERICK) dataset, we conduct two illustrative analyses using pre-existing research designs to highlight the analytical benefits of assessing aggregation sensitivity. The study demonstrates that aggregation choices can have far-reaching consequences and underscores the need to account for aggregation sensitivity when collecting, selecting, and analyzing event data in international studies.
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.