Optimal Item Calibration in the Context of the Swedish Scholastic Aptitude Test

Jonas Bjermo et al.

Applied Psychological Measurement2026https://doi.org/10.1177/01466216261420758article
AJG 2ABDC B
Weight
0.50

Abstract

Large-scale achievement tests require the existence of item banks with items for use in future tests. Before an item is included into the bank, its characteristics need to be estimated. The process of estimating the item characteristics is called item calibration. For the quality of the future achievement tests, it is important to perform this calibration well and it is desirable to estimate the item characteristics as efficiently as possible. Methods of optimal design have been developed to allocate pretest items to examinees with the most suited ability. Theoretical evidence shows advantages with using ability-dependent allocation of pretest items. However, it is not clear whether these theoretical results hold also in a real testing situation. In this paper, we investigate the performance of an optimal ability-dependent allocation in the context of the Swedish Scholastic Aptitude Test (SweSAT) and quantify the gain from using the optimal allocation. On average over all items, we see an improved precision of calibration. While this average improvement is moderate, we are able to identify for what kind of items the method works well. This enables targeting specific item types for optimal calibration. We also discuss possibilities for improvements of the method.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1177/01466216261420758

Or copy a formatted citation

@article{jonas2026,
  title        = {{Optimal Item Calibration in the Context of the Swedish Scholastic Aptitude Test}},
  author       = {Jonas Bjermo et al.},
  journal      = {Applied Psychological Measurement},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1177/01466216261420758},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Optimal Item Calibration in the Context of the Swedish Scholastic Aptitude Test

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.