Aggregation of Linked Data: A case study in the cultural heritage domain

Nuno Freire, Enno Meijers, Sjors De Valk, René Voorburg, Antoine Isaac, Roland Cornelissen

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

A very large number of online cultural heritage (CH) resources is made available through numerous digital libraries. To address the difficulties of discoverability in CH, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources' metadata. In the last years, the CH domain has invested in data models for Linked Data (LD) representation of CH metadata. LD, however, also has potential for innovating metadata aggregation. We present the results of a pilot case study within the Europeana Network. In this pilot, the National Library of The Netherlands plays the role of initial data provider, with the Dutch Digital Heritage Network the one of intermediary service providing datasets to Europeana. We analysed the requirements for an LD aggregation solution and defined a workflow that fulfils the same functional requirements as Europeana's current solution. The workflow was put into practice within the pilot and led to the development of several software components for managing datasets, harvesting LD, data analysis and integration. Our analysis of the experience discusses the effort of adopting such an LD approach for data providers and aggregators, the expertise required by CH data analysts, and the supporting tools required for semantic data.

Original languageEnglish
Title of host publicationProceedings - 2018 IEEE International Conference on Big Data, Big Data 2018
EditorsYang Song, Bing Liu, Kisung Lee, Naoki Abe, Calton Pu, Mu Qiao, Nesreen Ahmed, Donald Kossmann, Jeffrey Saltz, Jiliang Tang, Jingrui He, Huan Liu, Xiaohua Hu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages522-527
Number of pages6
ISBN (Electronic)9781538650356
DOIs
Publication statusPublished - 22 Jan 2019
Event2018 IEEE International Conference on Big Data, Big Data 2018 - Seattle, United States
Duration: 10 Dec 201813 Dec 2018

Conference

Conference2018 IEEE International Conference on Big Data, Big Data 2018
CountryUnited States
CitySeattle
Period10/12/1813/12/18

Fingerprint

Metadata
Agglomeration
Digital libraries
Data integration
Data structures
Semantics

Keywords

  • Big Data variety
  • data aggregation
  • data analysis
  • datasets
  • RDF
  • semantics

Cite this

Freire, N., Meijers, E., De Valk, S., Voorburg, R., Isaac, A., & Cornelissen, R. (2019). Aggregation of Linked Data: A case study in the cultural heritage domain. In Y. Song, B. Liu, K. Lee, N. Abe, C. Pu, M. Qiao, N. Ahmed, D. Kossmann, J. Saltz, J. Tang, J. He, H. Liu, ... X. Hu (Eds.), Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018 (pp. 522-527). [8622348] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BigData.2018.8622348
Freire, Nuno ; Meijers, Enno ; De Valk, Sjors ; Voorburg, René ; Isaac, Antoine ; Cornelissen, Roland. / Aggregation of Linked Data : A case study in the cultural heritage domain. Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018. editor / Yang Song ; Bing Liu ; Kisung Lee ; Naoki Abe ; Calton Pu ; Mu Qiao ; Nesreen Ahmed ; Donald Kossmann ; Jeffrey Saltz ; Jiliang Tang ; Jingrui He ; Huan Liu ; Xiaohua Hu. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 522-527
@inproceedings{9bfd10753eb84bfc93e76c9ffcf6b381,
title = "Aggregation of Linked Data: A case study in the cultural heritage domain",
abstract = "A very large number of online cultural heritage (CH) resources is made available through numerous digital libraries. To address the difficulties of discoverability in CH, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources' metadata. In the last years, the CH domain has invested in data models for Linked Data (LD) representation of CH metadata. LD, however, also has potential for innovating metadata aggregation. We present the results of a pilot case study within the Europeana Network. In this pilot, the National Library of The Netherlands plays the role of initial data provider, with the Dutch Digital Heritage Network the one of intermediary service providing datasets to Europeana. We analysed the requirements for an LD aggregation solution and defined a workflow that fulfils the same functional requirements as Europeana's current solution. The workflow was put into practice within the pilot and led to the development of several software components for managing datasets, harvesting LD, data analysis and integration. Our analysis of the experience discusses the effort of adopting such an LD approach for data providers and aggregators, the expertise required by CH data analysts, and the supporting tools required for semantic data.",
keywords = "Big Data variety, data aggregation, data analysis, datasets, RDF, semantics",
author = "Nuno Freire and Enno Meijers and {De Valk}, Sjors and Ren{\'e} Voorburg and Antoine Isaac and Roland Cornelissen",
year = "2019",
month = "1",
day = "22",
doi = "10.1109/BigData.2018.8622348",
language = "English",
pages = "522--527",
editor = "Yang Song and Bing Liu and Kisung Lee and Naoki Abe and Calton Pu and Mu Qiao and Nesreen Ahmed and Donald Kossmann and Jeffrey Saltz and Jiliang Tang and Jingrui He and Huan Liu and Xiaohua Hu",
booktitle = "Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

Freire, N, Meijers, E, De Valk, S, Voorburg, R, Isaac, A & Cornelissen, R 2019, Aggregation of Linked Data: A case study in the cultural heritage domain. in Y Song, B Liu, K Lee, N Abe, C Pu, M Qiao, N Ahmed, D Kossmann, J Saltz, J Tang, J He, H Liu & X Hu (eds), Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018., 8622348, Institute of Electrical and Electronics Engineers Inc., pp. 522-527, 2018 IEEE International Conference on Big Data, Big Data 2018, Seattle, United States, 10/12/18. https://doi.org/10.1109/BigData.2018.8622348

Aggregation of Linked Data : A case study in the cultural heritage domain. / Freire, Nuno; Meijers, Enno; De Valk, Sjors; Voorburg, René; Isaac, Antoine; Cornelissen, Roland.

Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018. ed. / Yang Song; Bing Liu; Kisung Lee; Naoki Abe; Calton Pu; Mu Qiao; Nesreen Ahmed; Donald Kossmann; Jeffrey Saltz; Jiliang Tang; Jingrui He; Huan Liu; Xiaohua Hu. Institute of Electrical and Electronics Engineers Inc., 2019. p. 522-527 8622348.

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Aggregation of Linked Data

T2 - A case study in the cultural heritage domain

AU - Freire, Nuno

AU - Meijers, Enno

AU - De Valk, Sjors

AU - Voorburg, René

AU - Isaac, Antoine

AU - Cornelissen, Roland

PY - 2019/1/22

Y1 - 2019/1/22

N2 - A very large number of online cultural heritage (CH) resources is made available through numerous digital libraries. To address the difficulties of discoverability in CH, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources' metadata. In the last years, the CH domain has invested in data models for Linked Data (LD) representation of CH metadata. LD, however, also has potential for innovating metadata aggregation. We present the results of a pilot case study within the Europeana Network. In this pilot, the National Library of The Netherlands plays the role of initial data provider, with the Dutch Digital Heritage Network the one of intermediary service providing datasets to Europeana. We analysed the requirements for an LD aggregation solution and defined a workflow that fulfils the same functional requirements as Europeana's current solution. The workflow was put into practice within the pilot and led to the development of several software components for managing datasets, harvesting LD, data analysis and integration. Our analysis of the experience discusses the effort of adopting such an LD approach for data providers and aggregators, the expertise required by CH data analysts, and the supporting tools required for semantic data.

AB - A very large number of online cultural heritage (CH) resources is made available through numerous digital libraries. To address the difficulties of discoverability in CH, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources' metadata. In the last years, the CH domain has invested in data models for Linked Data (LD) representation of CH metadata. LD, however, also has potential for innovating metadata aggregation. We present the results of a pilot case study within the Europeana Network. In this pilot, the National Library of The Netherlands plays the role of initial data provider, with the Dutch Digital Heritage Network the one of intermediary service providing datasets to Europeana. We analysed the requirements for an LD aggregation solution and defined a workflow that fulfils the same functional requirements as Europeana's current solution. The workflow was put into practice within the pilot and led to the development of several software components for managing datasets, harvesting LD, data analysis and integration. Our analysis of the experience discusses the effort of adopting such an LD approach for data providers and aggregators, the expertise required by CH data analysts, and the supporting tools required for semantic data.

KW - Big Data variety

KW - data aggregation

KW - data analysis

KW - datasets

KW - RDF

KW - semantics

UR - http://www.scopus.com/inward/record.url?scp=85062592997&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85062592997&partnerID=8YFLogxK

U2 - 10.1109/BigData.2018.8622348

DO - 10.1109/BigData.2018.8622348

M3 - Conference contribution

SP - 522

EP - 527

BT - Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018

A2 - Song, Yang

A2 - Liu, Bing

A2 - Lee, Kisung

A2 - Abe, Naoki

A2 - Pu, Calton

A2 - Qiao, Mu

A2 - Ahmed, Nesreen

A2 - Kossmann, Donald

A2 - Saltz, Jeffrey

A2 - Tang, Jiliang

A2 - He, Jingrui

A2 - Liu, Huan

A2 - Hu, Xiaohua

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Freire N, Meijers E, De Valk S, Voorburg R, Isaac A, Cornelissen R. Aggregation of Linked Data: A case study in the cultural heritage domain. In Song Y, Liu B, Lee K, Abe N, Pu C, Qiao M, Ahmed N, Kossmann D, Saltz J, Tang J, He J, Liu H, Hu X, editors, Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018. Institute of Electrical and Electronics Engineers Inc. 2019. p. 522-527. 8622348 https://doi.org/10.1109/BigData.2018.8622348