LOD lab: Scalable linked data processing

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

With tens if not hundreds of billions of logical statements, the Linked Open Data (LOD) is one of the biggest knowledge bases ever built. As such it is a gigantic source of information for applications in various domains, but also given its size an ideal test-bed for knowledge representation and reasoning, heterogeneous nature, and complexity. However, making use of this unique resource has proven next to impossible in the past due to a number of problems, including data collection, quality, accessibility, scalability, availability and findability. The LOD Laundromat and LOD Lab are recent infrastructures that addresses these problems in a systematic way, by automatically crawling, cleaning, indexing, analysing and republishing data in a unified way. Given a family of simple tools, LOD Lab allows researchers to query, access, analyse and manipulate hundreds of thousands of data documents seamlessly, e.g. facilitating experiments (e.g. for reasoning) over hundreds of thousands of (possibly integrated) datasets based on content and meta-data. This chapter provides the theoretical basis and practical skills required for making ideal use of this large scale experimental platform. First we study the problems that make it so hard to work with Semantic Web data in its current form. We’ll also propose generic solutions and introduce the tools the reader needs to get started with their own experiments on the LOD Cloud.

Original languageEnglish
Title of host publicationReasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures
PublisherSpringer/Verlag
Pages124-155
Number of pages32
Volume9885 LNCS
ISBN (Print)9783319494920
DOIs
Publication statusPublished - 2017
Event12th International Summer School on Reasoning Web Summer School, RW 2016 - Aberdeen, United Kingdom
Duration: 5 Sep 20169 Sep 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9885 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Conference

Conference12th International Summer School on Reasoning Web Summer School, RW 2016
CountryUnited Kingdom
City Aberdeen
Period5/09/169/09/16

Fingerprint

Linked Data
Knowledge representation
Semantic Web
Metadata
Scalability
Cleaning
Experiments
Availability
Knowledge Representation and Reasoning
Accessibility
Knowledge Base
Testbed
Indexing
Experiment
Infrastructure
Reasoning
Query
Resources

Cite this

Beek, W., Rietveld, L., Ilievski, F., & Schlobach, S. (2017). LOD lab: Scalable linked data processing. In Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures (Vol. 9885 LNCS, pp. 124-155). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9885 LNCS). Springer/Verlag. https://doi.org/10.1007/978-3-319-49493-7_4
Beek, Wouter ; Rietveld, Laurens ; Ilievski, F. ; Schlobach, Stefan. / LOD lab : Scalable linked data processing. Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures. Vol. 9885 LNCS Springer/Verlag, 2017. pp. 124-155 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{97e7492b3f604d8597c2ea8af941a33e,
title = "LOD lab: Scalable linked data processing",
abstract = "With tens if not hundreds of billions of logical statements, the Linked Open Data (LOD) is one of the biggest knowledge bases ever built. As such it is a gigantic source of information for applications in various domains, but also given its size an ideal test-bed for knowledge representation and reasoning, heterogeneous nature, and complexity. However, making use of this unique resource has proven next to impossible in the past due to a number of problems, including data collection, quality, accessibility, scalability, availability and findability. The LOD Laundromat and LOD Lab are recent infrastructures that addresses these problems in a systematic way, by automatically crawling, cleaning, indexing, analysing and republishing data in a unified way. Given a family of simple tools, LOD Lab allows researchers to query, access, analyse and manipulate hundreds of thousands of data documents seamlessly, e.g. facilitating experiments (e.g. for reasoning) over hundreds of thousands of (possibly integrated) datasets based on content and meta-data. This chapter provides the theoretical basis and practical skills required for making ideal use of this large scale experimental platform. First we study the problems that make it so hard to work with Semantic Web data in its current form. We’ll also propose generic solutions and introduce the tools the reader needs to get started with their own experiments on the LOD Cloud.",
author = "Wouter Beek and Laurens Rietveld and F. Ilievski and Stefan Schlobach",
year = "2017",
doi = "10.1007/978-3-319-49493-7_4",
language = "English",
isbn = "9783319494920",
volume = "9885 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer/Verlag",
pages = "124--155",
booktitle = "Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures",

}

Beek, W, Rietveld, L, Ilievski, F & Schlobach, S 2017, LOD lab: Scalable linked data processing. in Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures. vol. 9885 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9885 LNCS, Springer/Verlag, pp. 124-155, 12th International Summer School on Reasoning Web Summer School, RW 2016, Aberdeen, United Kingdom, 5/09/16. https://doi.org/10.1007/978-3-319-49493-7_4

LOD lab : Scalable linked data processing. / Beek, Wouter; Rietveld, Laurens; Ilievski, F.; Schlobach, Stefan.

Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures. Vol. 9885 LNCS Springer/Verlag, 2017. p. 124-155 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9885 LNCS).

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - LOD lab

T2 - Scalable linked data processing

AU - Beek, Wouter

AU - Rietveld, Laurens

AU - Ilievski, F.

AU - Schlobach, Stefan

PY - 2017

Y1 - 2017

N2 - With tens if not hundreds of billions of logical statements, the Linked Open Data (LOD) is one of the biggest knowledge bases ever built. As such it is a gigantic source of information for applications in various domains, but also given its size an ideal test-bed for knowledge representation and reasoning, heterogeneous nature, and complexity. However, making use of this unique resource has proven next to impossible in the past due to a number of problems, including data collection, quality, accessibility, scalability, availability and findability. The LOD Laundromat and LOD Lab are recent infrastructures that addresses these problems in a systematic way, by automatically crawling, cleaning, indexing, analysing and republishing data in a unified way. Given a family of simple tools, LOD Lab allows researchers to query, access, analyse and manipulate hundreds of thousands of data documents seamlessly, e.g. facilitating experiments (e.g. for reasoning) over hundreds of thousands of (possibly integrated) datasets based on content and meta-data. This chapter provides the theoretical basis and practical skills required for making ideal use of this large scale experimental platform. First we study the problems that make it so hard to work with Semantic Web data in its current form. We’ll also propose generic solutions and introduce the tools the reader needs to get started with their own experiments on the LOD Cloud.

AB - With tens if not hundreds of billions of logical statements, the Linked Open Data (LOD) is one of the biggest knowledge bases ever built. As such it is a gigantic source of information for applications in various domains, but also given its size an ideal test-bed for knowledge representation and reasoning, heterogeneous nature, and complexity. However, making use of this unique resource has proven next to impossible in the past due to a number of problems, including data collection, quality, accessibility, scalability, availability and findability. The LOD Laundromat and LOD Lab are recent infrastructures that addresses these problems in a systematic way, by automatically crawling, cleaning, indexing, analysing and republishing data in a unified way. Given a family of simple tools, LOD Lab allows researchers to query, access, analyse and manipulate hundreds of thousands of data documents seamlessly, e.g. facilitating experiments (e.g. for reasoning) over hundreds of thousands of (possibly integrated) datasets based on content and meta-data. This chapter provides the theoretical basis and practical skills required for making ideal use of this large scale experimental platform. First we study the problems that make it so hard to work with Semantic Web data in its current form. We’ll also propose generic solutions and introduce the tools the reader needs to get started with their own experiments on the LOD Cloud.

UR - http://www.scopus.com/inward/record.url?scp=85014879851&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85014879851&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-49493-7_4

DO - 10.1007/978-3-319-49493-7_4

M3 - Conference contribution

SN - 9783319494920

VL - 9885 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 124

EP - 155

BT - Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures

PB - Springer/Verlag

ER -

Beek W, Rietveld L, Ilievski F, Schlobach S. LOD lab: Scalable linked data processing. In Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Tutorial Lectures. Vol. 9885 LNCS. Springer/Verlag. 2017. p. 124-155. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-49493-7_4