ChatGPT vs. Experts: Can GenAI Develop High-Quality Organizational and Policy Objectives?

Jay Simon & Johannes Ulrich Siebert

Decision Analysis2025https://doi.org/10.1287/deca.2025.0387article
AJG 1ABDC A
Weight
0.50

Abstract

This paper explores the efficacy of generative artificial intelligence (GenAI) for value-focused thinking, specifically the ability to generate high-quality sets of objectives for organizational or policy decisions. Overall, we find that most of the GenAI objectives are viable individually, but the sets as a whole are highly flawed. They often include nonessential considerations, omitting important ones. In addition, they are redundant and nondecomposable, often because of a tendency to include means objectives even when explicitly instructed not to. However, the sets of objectives can be improved by implementing best practices in prompting and with decision analysis (DA) expertise. The results provide further evidence of the importance of a human in the loop; in this case, GenAI tools are helpful for brainstorming objectives, but an expert with a background in decision analysis is needed before the results are used to support decision making. To facilitate this, a four-step approach incorporating the relative strengths of both GenAI and decision analysts is presented and demonstrated. History: This paper has been accepted for the Decision Analysis Special Issue on the Implications of Advances in Artificial Intelligence for Decision Analysis. Supplemental Material: The online appendix is available at https://doi.org/10.1287/deca.2025.0387 .

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1287/deca.2025.0387

Or copy a formatted citation

@article{jay2025,
  title        = {{ChatGPT vs. Experts: Can GenAI Develop High-Quality Organizational and Policy Objectives?}},
  author       = {Jay Simon & Johannes Ulrich Siebert},
  journal      = {Decision Analysis},
  year         = {2025},
  doi          = {https://doi.org/https://doi.org/10.1287/deca.2025.0387},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

ChatGPT vs. Experts: Can GenAI Develop High-Quality Organizational and Policy Objectives?

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.