CEDAR: The Dutch Historical Censuses as Linked Open Data

Albert Meroño-Peñuela, Ashkan Ashkpour, Christophe Guéret, Stefan Schlobach

Research output: Contribution to JournalArticleAcademicpeer-review

316 Downloads (Pure)

Abstract

Here, we describe the CEDAR dataset, a five-star Linked Open Data representation of the Dutch historical censuses. These were conducted in the Netherlands once every 10 years from 1795 to 1971. We produce a linked dataset from a digitized sample of 2,288 tables. It contains more than 6.8 million statistical observations about the demography, labour and housing of Dutch society in the 18th, 19th and 20th centuries. The dataset is modeled using the RDF Data Cube, Open Annotation, and PROV vocabularies. These are used to represent the multidimensionality of the data, to express rules of data harmonization, and to keep track of the provenance of all data points and their transformations, respectively. We link observations within the dataset to well known standard classification systems in social history, such as the Historical International Standard Classification of Occupations (HISCO) and the Amsterdamse Code (AC). The three contributions of the dataset are (1) an easier access to integrated census data for historical researchers; (2) richer connections to related Linked Data resources; and (3) novel concept schemes of historical relevance, like classifications of historical religions and historical house types.
Original languageEnglish
Pages (from-to)297-310
Number of pages14
JournalSemantic Web
Volume8
Issue number2
Early online date6 Dec 2016
DOIs
Publication statusPublished - 2017

Keywords

  • census data
  • Linked Open Data
  • RDF Data Cube
  • Social history

Fingerprint

Dive into the research topics of 'CEDAR: The Dutch Historical Censuses as Linked Open Data'. Together they form a unique fingerprint.

Cite this