Correcting for Linkage Errors in Contingency Tables: A Cautionary Tale

Sander Scholtus, Natalie Shlomo, Ton De Waal

Research output: Contribution to journalArticlepeer-review

Abstract

Record linkage aims to bring records together from two or more files that belong to the same statistical entity. Naïvely treating a linked file as if there are no linkage errors may lead to biased inference. We present two general approaches for compensating for linkage error when calculating and analysing a two-way contingency table for categorical data, and study the following question: under what conditions can a compensation approach improve on the naïve approach, where linkage error is not compensated for? To this end, we compare estimation errors, bias, variance and mean square error for the naïve approach and two compensation approaches by means of an analytical study as well as a simulation study.
Original languageEnglish
Pages (from-to)122-137
Number of pages16
JournalJournal of Statistical Planning and Inference
Volume218
Early online date26 Oct 2021
DOIs
Publication statusPublished - 1 May 2022

Keywords

  • probabilistic record linkage,=
  • contingency table
  • exchangeable linkage error model
  • linkage error correction

Fingerprint

Dive into the research topics of 'Correcting for Linkage Errors in Contingency Tables: A Cautionary Tale'. Together they form a unique fingerprint.

Cite this