Few and Different: Detecting Examinees With Preknowledge Using Extended Isolation Forests

Nathan Smith et al.

Applied Psychological Measurement2025https://doi.org/10.1177/01466216251320403article
AJG 2ABDC B
Weight
0.37

Abstract

Item preknowledge refers to the case where examinees have advanced knowledge of test material prior to taking the examination. When examinees have item preknowledge, the scores that result from those item responses are not true reflections of the examinee's proficiency. Further, this contamination in the data also has an impact on the item parameter estimates and therefore has an impact on scores for all examinees, regardless of whether they had prior knowledge. To ensure the validity of test scores, it is essential to identify both issues: compromised items (CIs) and examinees with preknowledge (EWPs). In some cases, the CIs are known, and the task is reduced to determining the EWPs. However, due to the potential threat to validity, it is critical for high-stakes testing programs to have a process for routinely monitoring for evidence of EWPs, often when CIs are unknown. Further, even knowing that specific items may have been compromised does not guarantee that any examinees had prior access to those items, or that those examinees that did have prior access know how to effectively use the preknowledge. Therefore, this paper attempts to use response behavior to identify item preknowledge without knowledge of which items may or may not have been compromised. While most research in this area has relied on traditional psychometric models, we investigate the utility of an unsupervised machine learning algorithm, extended isolation forest (EIF), to detect EWPs. Similar to previous research, the response behavior being analyzed is response time (RT) and response accuracy (RA).

1 citation

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1177/01466216251320403

Or copy a formatted citation

@article{nathan2025,
  title        = {{Few and Different: Detecting Examinees With Preknowledge Using Extended Isolation Forests}},
  author       = {Nathan Smith et al.},
  journal      = {Applied Psychological Measurement},
  year         = {2025},
  doi          = {https://doi.org/https://doi.org/10.1177/01466216251320403},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Few and Different: Detecting Examinees With Preknowledge Using Extended Isolation Forests

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.37

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.16 × 0.4 = 0.06
M · momentum0.53 × 0.15 = 0.08
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.