Trustworthiness in Computational Theory Construction: Dimensionalization and Category Surfacing1
Wendy Günther et al.
Abstract
In this methods article, we unpack how researchers can foster trustworthiness in dimensionalization and category surfacing (DCS), a key method family within the genre of computational theory construction (CTC). Information systems (IS), management, and organizational scholars are increasingly leveraging DCS tools such as topic modeling, word embeddings, and clustering to surface latent categories and dimensions from textual data for theory construction. Yet they struggle because evaluations of such research often default to transparency, operationalized as replicability and accountability, which obscures the analytical choices that actually make DCS research rigorous. In this study, we recast transparency as a means toward trustworthiness. We treat researchers’ analytical moves as the primary unit of methodological reasoning in how they design, conduct, and disclose their choices across research phases. We develop a framework that authors, reviewers, and editors can use to construct and evaluate DCS research. The framework specifies how trustworthiness arises from the interplay of two research design choices: primacy to theoretical versus practice lexicons, and whether the content of texts or the structure of the corpus carries the theoretical load. We articulate expectations for conduct and disclosure across these design choices, clarifying how proportionate reasoning anchors trustworthiness. We conclude with implications for advancing trustworthiness within the broader CTC community and across other computational approaches to research.
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.