Playing with matches: An assessment of accuracy in linked historical data |
| |
Authors: | Catherine G Massey |
| |
Institution: | Population Studies Center, Institute for Social Research, University of Michigan |
| |
Abstract: | This article evaluates linkage quality achieved by various record linkage techniques used in historical demography. The author creates benchmark, or truth, data by linking the 2005 Current Population Survey Annual Social and Economic Supplement to the Social Security Administration's numeric identification system by social security number. By comparing simulated linkages to the benchmark data, she examines the value added (in terms of number and quality of links) from incorporating text-string comparators, adjusting age, and using a probabilistic matching algorithm. She finds that text-string comparators and probabilistic approaches are useful for increasing the linkage rate, but use of text-string comparators may decrease accuracy in some cases. Overall, probabilistic matching offers the best balance between linkage rates and accuracy. |
| |
Keywords: | Census historical demography microdata record linkage |
|
|