The dataLegend ecosystem for historical statistics

Rinke Hoekstra, Albert Meroño-Peñuela, Auke Rijpma, Richard Zijdeman, Ashkan Ashkpour, Kathrin Dentler, Ivo Zandhuis, Laurens Rietveld

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

The main promise of the digital humanities is the ability to perform scholarly studies at a much broader scale, and in a much more reusable fashion. The key enabler for such studies is the availability of sufficiently well described data. For the field of socio-economic history, data usually comes in a tabular form. Existing efforts to curate and publish datasets take a top-down approach and are focused on large collections, produce scarce metadata, require expertise for effective integration, provide poor user support while producing mappings, and present issues at data access. This paper presents the datalegend platform, which addresses the long tail of research data by catering for the needs of individual scholars. datalegend allows researchers to publish their (small) datasets, link them to existing vocabularies and other datasets, and thereby contribute to a growing collection of interlinked datasets. We present the architecture of datalegend; its core vocabularies and data; and QBer, an interactive, user supportive mapping generator and RDF converter. We evaluate our results by showing how our system facilitates use cases in socio-economic history.

Original languageEnglish
Pages (from-to)49-61
Number of pages13
JournalJournal of Web Semantics
Volume50
Early online date10 Mar 2018
DOIs
Publication statusPublished - May 2018

Fingerprint

Ecosystems
History
Statistics
Economics
Metadata
Availability

Keywords

  • Digital humanities
  • Linked data
  • QBer
  • Structured data

Cite this

Hoekstra, Rinke ; Meroño-Peñuela, Albert ; Rijpma, Auke ; Zijdeman, Richard ; Ashkpour, Ashkan ; Dentler, Kathrin ; Zandhuis, Ivo ; Rietveld, Laurens. / The dataLegend ecosystem for historical statistics. In: Journal of Web Semantics. 2018 ; Vol. 50. pp. 49-61.
@article{63132d99cd3b449085a58dfdea03cd39,
title = "The dataLegend ecosystem for historical statistics",
abstract = "The main promise of the digital humanities is the ability to perform scholarly studies at a much broader scale, and in a much more reusable fashion. The key enabler for such studies is the availability of sufficiently well described data. For the field of socio-economic history, data usually comes in a tabular form. Existing efforts to curate and publish datasets take a top-down approach and are focused on large collections, produce scarce metadata, require expertise for effective integration, provide poor user support while producing mappings, and present issues at data access. This paper presents the datalegend platform, which addresses the long tail of research data by catering for the needs of individual scholars. datalegend allows researchers to publish their (small) datasets, link them to existing vocabularies and other datasets, and thereby contribute to a growing collection of interlinked datasets. We present the architecture of datalegend; its core vocabularies and data; and QBer, an interactive, user supportive mapping generator and RDF converter. We evaluate our results by showing how our system facilitates use cases in socio-economic history.",
keywords = "Digital humanities, Linked data, QBer, Structured data",
author = "Rinke Hoekstra and Albert Mero{\~n}o-Pe{\~n}uela and Auke Rijpma and Richard Zijdeman and Ashkan Ashkpour and Kathrin Dentler and Ivo Zandhuis and Laurens Rietveld",
year = "2018",
month = "5",
doi = "10.1016/j.websem.2018.03.001",
language = "English",
volume = "50",
pages = "49--61",
journal = "Journal of Web Semantics",
issn = "1570-8268",
publisher = "Elsevier",

}

Hoekstra, R, Meroño-Peñuela, A, Rijpma, A, Zijdeman, R, Ashkpour, A, Dentler, K, Zandhuis, I & Rietveld, L 2018, 'The dataLegend ecosystem for historical statistics' Journal of Web Semantics, vol. 50, pp. 49-61. https://doi.org/10.1016/j.websem.2018.03.001

The dataLegend ecosystem for historical statistics. / Hoekstra, Rinke; Meroño-Peñuela, Albert; Rijpma, Auke; Zijdeman, Richard; Ashkpour, Ashkan; Dentler, Kathrin; Zandhuis, Ivo; Rietveld, Laurens.

In: Journal of Web Semantics, Vol. 50, 05.2018, p. 49-61.

Research output: Contribution to JournalArticleAcademicpeer-review

TY - JOUR

T1 - The dataLegend ecosystem for historical statistics

AU - Hoekstra, Rinke

AU - Meroño-Peñuela, Albert

AU - Rijpma, Auke

AU - Zijdeman, Richard

AU - Ashkpour, Ashkan

AU - Dentler, Kathrin

AU - Zandhuis, Ivo

AU - Rietveld, Laurens

PY - 2018/5

Y1 - 2018/5

N2 - The main promise of the digital humanities is the ability to perform scholarly studies at a much broader scale, and in a much more reusable fashion. The key enabler for such studies is the availability of sufficiently well described data. For the field of socio-economic history, data usually comes in a tabular form. Existing efforts to curate and publish datasets take a top-down approach and are focused on large collections, produce scarce metadata, require expertise for effective integration, provide poor user support while producing mappings, and present issues at data access. This paper presents the datalegend platform, which addresses the long tail of research data by catering for the needs of individual scholars. datalegend allows researchers to publish their (small) datasets, link them to existing vocabularies and other datasets, and thereby contribute to a growing collection of interlinked datasets. We present the architecture of datalegend; its core vocabularies and data; and QBer, an interactive, user supportive mapping generator and RDF converter. We evaluate our results by showing how our system facilitates use cases in socio-economic history.

AB - The main promise of the digital humanities is the ability to perform scholarly studies at a much broader scale, and in a much more reusable fashion. The key enabler for such studies is the availability of sufficiently well described data. For the field of socio-economic history, data usually comes in a tabular form. Existing efforts to curate and publish datasets take a top-down approach and are focused on large collections, produce scarce metadata, require expertise for effective integration, provide poor user support while producing mappings, and present issues at data access. This paper presents the datalegend platform, which addresses the long tail of research data by catering for the needs of individual scholars. datalegend allows researchers to publish their (small) datasets, link them to existing vocabularies and other datasets, and thereby contribute to a growing collection of interlinked datasets. We present the architecture of datalegend; its core vocabularies and data; and QBer, an interactive, user supportive mapping generator and RDF converter. We evaluate our results by showing how our system facilitates use cases in socio-economic history.

KW - Digital humanities

KW - Linked data

KW - QBer

KW - Structured data

UR - http://www.scopus.com/inward/record.url?scp=85044315945&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85044315945&partnerID=8YFLogxK

U2 - 10.1016/j.websem.2018.03.001

DO - 10.1016/j.websem.2018.03.001

M3 - Article

VL - 50

SP - 49

EP - 61

JO - Journal of Web Semantics

JF - Journal of Web Semantics

SN - 1570-8268

ER -