TY - JOUR
T1 - CEDAR: The Dutch Historical Censuses as Linked Open Data
AU - Meroño-Peñuela, Albert
AU - Ashkpour, Ashkan
AU - Guéret, Christophe
AU - Schlobach, Stefan
PY - 2017
Y1 - 2017
N2 - Here, we describe the CEDAR dataset, a five-star Linked Open Data representation of the Dutch historical censuses. These were conducted in the Netherlands once every 10 years from 1795 to 1971. We produce a linked dataset from a digitized sample of 2,288 tables. It contains more than 6.8 million statistical observations about the demography, labour and housing of Dutch society in the 18th, 19th and 20th centuries. The dataset is modeled using the RDF Data Cube, Open Annotation, and PROV vocabularies. These are used to represent the multidimensionality of the data, to express rules of data harmonization, and to keep track of the provenance of all data points and their transformations, respectively. We link observations within the dataset to well known standard classification systems in social history, such as the Historical International Standard Classification of Occupations (HISCO) and the Amsterdamse Code (AC). The three contributions of the dataset are (1) an easier access to integrated census data for historical researchers; (2) richer connections to related Linked Data resources; and (3) novel concept schemes of historical relevance, like classifications of historical religions and historical house types.
AB - Here, we describe the CEDAR dataset, a five-star Linked Open Data representation of the Dutch historical censuses. These were conducted in the Netherlands once every 10 years from 1795 to 1971. We produce a linked dataset from a digitized sample of 2,288 tables. It contains more than 6.8 million statistical observations about the demography, labour and housing of Dutch society in the 18th, 19th and 20th centuries. The dataset is modeled using the RDF Data Cube, Open Annotation, and PROV vocabularies. These are used to represent the multidimensionality of the data, to express rules of data harmonization, and to keep track of the provenance of all data points and their transformations, respectively. We link observations within the dataset to well known standard classification systems in social history, such as the Historical International Standard Classification of Occupations (HISCO) and the Amsterdamse Code (AC). The three contributions of the dataset are (1) an easier access to integrated census data for historical researchers; (2) richer connections to related Linked Data resources; and (3) novel concept schemes of historical relevance, like classifications of historical religions and historical house types.
KW - census data
KW - Linked Open Data
KW - RDF Data Cube
KW - Social history
UR - http://www.scopus.com/inward/record.url?scp=85004045046&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85004045046&partnerID=8YFLogxK
UR - https://semantic-web-journal.net/content/cedar-dutch-historical-censuses-linked-open-data-1
U2 - 10.3233/SW-160233
DO - 10.3233/SW-160233
M3 - Article
SN - 1570-0844
VL - 8
SP - 297
EP - 310
JO - Semantic Web
JF - Semantic Web
IS - 2
ER -