Reliable Granular References to Changing Linked Data

Tobias Kuhn, Egon Willighagen, Chris T. Evelo, Núria Queralt-Rosinach, Emilio Centeno, Laura I. Furlong

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets that researchers use for their analyses can be referenced and retrieved efficiently with optimized precision, persistence, and reliability.
Original languageEnglish
Pages (from-to)436-451
JournalProceedings of the 16th International Semantic Web Conference (ISWC) 2017
DOIs
Publication statusPublished - 30 Aug 2017

Fingerprint

provenance
metadata
explosion
persistence
history
need

Bibliographical note

In: The Semantic Web – ISWC 2017; 16th International Semantic Web Conference
Vienna, Austria, October 21–25, 2017; Proceedings, Part I

Keywords

  • cs.DL

Cite this

Kuhn, Tobias ; Willighagen, Egon ; Evelo, Chris T. ; Queralt-Rosinach, Núria ; Centeno, Emilio ; Furlong, Laura I. / Reliable Granular References to Changing Linked Data. In: Proceedings of the 16th International Semantic Web Conference (ISWC) 2017. 2017 ; pp. 436-451.
@article{b2ef2b1b389b42a5b1f757a46699d588,
title = "Reliable Granular References to Changing Linked Data",
abstract = "Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets that researchers use for their analyses can be referenced and retrieved efficiently with optimized precision, persistence, and reliability.",
keywords = "cs.DL",
author = "Tobias Kuhn and Egon Willighagen and Evelo, {Chris T.} and N{\'u}ria Queralt-Rosinach and Emilio Centeno and Furlong, {Laura I.}",
note = "In: The Semantic Web – ISWC 2017; 16th International Semantic Web Conference Vienna, Austria, October 21–25, 2017; Proceedings, Part I",
year = "2017",
month = "8",
day = "30",
doi = "10.1007/978-3-319-68288-4_26",
language = "English",
pages = "436--451",
journal = "Proceedings of the 16th International Semantic Web Conference (ISWC) 2017",

}

Reliable Granular References to Changing Linked Data. / Kuhn, Tobias; Willighagen, Egon; Evelo, Chris T.; Queralt-Rosinach, Núria; Centeno, Emilio; Furlong, Laura I.

In: Proceedings of the 16th International Semantic Web Conference (ISWC) 2017, 30.08.2017, p. 436-451.

Research output: Contribution to JournalArticleAcademicpeer-review

TY - JOUR

T1 - Reliable Granular References to Changing Linked Data

AU - Kuhn, Tobias

AU - Willighagen, Egon

AU - Evelo, Chris T.

AU - Queralt-Rosinach, Núria

AU - Centeno, Emilio

AU - Furlong, Laura I.

N1 - In: The Semantic Web – ISWC 2017; 16th International Semantic Web Conference Vienna, Austria, October 21–25, 2017; Proceedings, Part I

PY - 2017/8/30

Y1 - 2017/8/30

N2 - Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets that researchers use for their analyses can be referenced and retrieved efficiently with optimized precision, persistence, and reliability.

AB - Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets that researchers use for their analyses can be referenced and retrieved efficiently with optimized precision, persistence, and reliability.

KW - cs.DL

U2 - 10.1007/978-3-319-68288-4_26

DO - 10.1007/978-3-319-68288-4_26

M3 - Article

SP - 436

EP - 451

JO - Proceedings of the 16th International Semantic Web Conference (ISWC) 2017

JF - Proceedings of the 16th International Semantic Web Conference (ISWC) 2017

ER -