Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940

During the 19th and early 20th century about 220,000 Dutch born persons migrated to the USA. The Historical Sample of the Netherlands (HSN) contains about 85,500 persons born in the Netherlands between 1812 and 1922. In this article we report the way we have matched persons from the HSN with the Am...

Full description

Bibliographic Details
Main Authors: Diogo Paiva, Francisco Anguita, Kees Mandemakers
Format: Article
Language:English
Published: International Instititute of Social History 2020-09-01
Series:Historical Life Course Studies
Subjects:
Online Access:https://test.openjournals.nl/hlcs/article/view/9312
id doaj-701f347b83b2474ca3ed98d19ac9a734
record_format Article
spelling doaj-701f347b83b2474ca3ed98d19ac9a7342021-05-04T10:12:35ZengInternational Instititute of Social HistoryHistorical Life Course Studies2352-63432020-09-019Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940Diogo PaivaFrancisco AnguitaKees Mandemakers During the 19th and early 20th century about 220,000 Dutch born persons migrated to the USA. The Historical Sample of the Netherlands (HSN) contains about 85,500 persons born in the Netherlands between 1812 and 1922. In this article we report the way we have matched persons from the HSN with the American censuses from the period 1850 till 1940. For this purpose, a linking process was designed, comprising of three stages: harmonization, matching and validation. The different nature of the two datasets (HSN and the USA Censuses) asked for some harmonization prior to the matching. Once the data had been properly prepared, two strategies were applied in order to link the data sets. The first one, called Similarity Approach, matched individuals from both datasets by comparing on the basis of resemblance of first and last names. The second approach, called Transformation Approach, made use of dictionaries with Anglicized versions of Dutch first and last names and their most common or most likely Dutch original(s). Because of the sample character of the HSN even exact matches showed ambiguity that needs to be resolved. For this reason, a validation process comparing the household context was run to provide a more trustworthy result. In the end we identified 484 individuals present in the HSN database with reliable links to the American censuses. We also evaluated the result in the light of what we know from emigration patterns to the USA over time and period and we concluded that our efforts have produced a reasonable result. Nevertheless, we are aware that we may have missed links. We also found that at least 45% of the emigrants returned to the Netherlands at some point during their life course. https://test.openjournals.nl/hlcs/article/view/9312Historical life coursesNominal record matchingEmigrationSocial historyHistorical demography
collection DOAJ
language English
format Article
sources DOAJ
author Diogo Paiva
Francisco Anguita
Kees Mandemakers
spellingShingle Diogo Paiva
Francisco Anguita
Kees Mandemakers
Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940
Historical Life Course Studies
Historical life courses
Nominal record matching
Emigration
Social history
Historical demography
author_facet Diogo Paiva
Francisco Anguita
Kees Mandemakers
author_sort Diogo Paiva
title Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940
title_short Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940
title_full Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940
title_fullStr Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940
title_full_unstemmed Linking the Historical Sample of the Netherlands with the USA Censuses, 1850–1940
title_sort linking the historical sample of the netherlands with the usa censuses, 1850–1940
publisher International Instititute of Social History
series Historical Life Course Studies
issn 2352-6343
publishDate 2020-09-01
description During the 19th and early 20th century about 220,000 Dutch born persons migrated to the USA. The Historical Sample of the Netherlands (HSN) contains about 85,500 persons born in the Netherlands between 1812 and 1922. In this article we report the way we have matched persons from the HSN with the American censuses from the period 1850 till 1940. For this purpose, a linking process was designed, comprising of three stages: harmonization, matching and validation. The different nature of the two datasets (HSN and the USA Censuses) asked for some harmonization prior to the matching. Once the data had been properly prepared, two strategies were applied in order to link the data sets. The first one, called Similarity Approach, matched individuals from both datasets by comparing on the basis of resemblance of first and last names. The second approach, called Transformation Approach, made use of dictionaries with Anglicized versions of Dutch first and last names and their most common or most likely Dutch original(s). Because of the sample character of the HSN even exact matches showed ambiguity that needs to be resolved. For this reason, a validation process comparing the household context was run to provide a more trustworthy result. In the end we identified 484 individuals present in the HSN database with reliable links to the American censuses. We also evaluated the result in the light of what we know from emigration patterns to the USA over time and period and we concluded that our efforts have produced a reasonable result. Nevertheless, we are aware that we may have missed links. We also found that at least 45% of the emigrants returned to the Netherlands at some point during their life course.
topic Historical life courses
Nominal record matching
Emigration
Social history
Historical demography
url https://test.openjournals.nl/hlcs/article/view/9312
work_keys_str_mv AT diogopaiva linkingthehistoricalsampleofthenetherlandswiththeusacensuses18501940
AT franciscoanguita linkingthehistoricalsampleofthenetherlandswiththeusacensuses18501940
AT keesmandemakers linkingthehistoricalsampleofthenetherlandswiththeusacensuses18501940
_version_ 1721479542959243264