The Impact of Latent Density Misspecification on Item Response Theory Equating Methods

Kyung Y. Kim et al.

Applied Psychological Measurement2026https://doi.org/10.1177/01466216261425440article
AJG 2ABDC B
Weight
0.50

Abstract

Item response theory (IRT) observed and true score equating are often conducted assuming that the latent variable is normally distributed. Although this might be a reasonable assumption for many educational and psychological assessments, not all variables can be approximated by a normal distribution. Under the common-item nonequivalent groups design, the current study examined the impact of latent density misspecification on IRT observed and true score equating. Specifically, equating results provided by two separate calibration estimates based on the Stocking-Lord linking method with normal and uniform weights and three concurrent calibration estimates obtained with different characterizations of the latent densities for the old and new groups were compared using both simulated and real data sets. In general, the concurrent calibration method with the latent densities for the two groups estimated using the empirical histogram method provided equating results with the least amount of error for most of the study conditions. Using normal weights with the Stocking-Lord method generally performed much better than using uniform weights; however, the overall performance of the Stocking-Lord method with normal weights was acceptable only if the latent densities for the two groups were normal distributions or close to normal distributions.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1177/01466216261425440

Or copy a formatted citation

@article{kyung2026,
  title        = {{The Impact of Latent Density Misspecification on Item Response Theory Equating Methods}},
  author       = {Kyung Y. Kim et al.},
  journal      = {Applied Psychological Measurement},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1177/01466216261425440},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

The Impact of Latent Density Misspecification on Item Response Theory Equating Methods

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.