On linkage bias‐correction for estimators using iterated bootstraps

Siu‐Ming Tam et al.

International Statistical Review2026https://doi.org/10.1111/insr.70032article
AJG 3ABDC A
Weight
0.50

Abstract

By amalgamating data from disparate sources, the resulting integrated dataset becomes a valuable resource for statistical analysis. In probabilistic record linkage, the effectiveness of such integration relies on the availability of linkage variables free from errors. Where this is lacking, the linked data set would suffer from linkage errors and the resultant analyses, linkage bias. This paper proposes a methodology leveraging the bootstrap technique to devise linkage bias‐corrected estimators. Additionally, it introduces a test to assess whether increasing the number of bootstrap iterations meaningfully reduces linkage bias or merely inflates variance without further improving accuracy. An application of these methodologies is demonstrated through the analysis of a simulated dataset featuring hormone information, along with a dataset obtained from linking two data sets from the Australian Bureau of Statistics' labour mobility surveys.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1111/insr.70032

Or copy a formatted citation

@article{siu‐ming2026,
  title        = {{On linkage bias‐correction for estimators using iterated bootstraps}},
  author       = {Siu‐Ming Tam et al.},
  journal      = {International Statistical Review},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1111/insr.70032},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

On linkage bias‐correction for estimators using iterated bootstraps

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.