Meta-data for a lot of LOD

Laurens Rietveld, Wouter Beek, Rinke Hoekstra, Stefan Schlobach

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

This paper introduces the LOD Laundromat meta-dataset, a continuously updated RDF meta-dataset that describes the documents crawled, cleaned and (re)published by the LOD Laundromat. This meta-dataset of over 110 million triples contains structural information for more than 650,000 documents (and growing). Dataset meta-data is often not provided alongside published data, it is incomplete or it is incomparable given the way they were generated. The LOD Laundromat meta-dataset provides a wide range of structural dataset properties, such as the number of triples in LOD Laundromat documents, the average degree in documents, and the distinct number of Blank Nodes, Literals and IRIs. This makes it a particularly useful dataset for data comparison and analytics, as well as for the global study of the Web of Data. This paper presents the dataset, its requirements, and its impact.
LanguageEnglish
Pages1067-1080
Number of pages14
JournalSemantic Web
Volume8
Issue number6
DOIs
Publication statusPublished - 1 Jan 2017

Fingerprint

Metadata
Structural properties

Cite this

Rietveld, Laurens ; Beek, Wouter ; Hoekstra, Rinke ; Schlobach, Stefan. / Meta-data for a lot of LOD. In: Semantic Web. 2017 ; Vol. 8, No. 6. pp. 1067-1080.
@article{f79133cc037d4f42a6bd4bd84d446488,
title = "Meta-data for a lot of LOD",
abstract = "This paper introduces the LOD Laundromat meta-dataset, a continuously updated RDF meta-dataset that describes the documents crawled, cleaned and (re)published by the LOD Laundromat. This meta-dataset of over 110 million triples contains structural information for more than 650,000 documents (and growing). Dataset meta-data is often not provided alongside published data, it is incomplete or it is incomparable given the way they were generated. The LOD Laundromat meta-dataset provides a wide range of structural dataset properties, such as the number of triples in LOD Laundromat documents, the average degree in documents, and the distinct number of Blank Nodes, Literals and IRIs. This makes it a particularly useful dataset for data comparison and analytics, as well as for the global study of the Web of Data. This paper presents the dataset, its requirements, and its impact.",
author = "Laurens Rietveld and Wouter Beek and Rinke Hoekstra and Stefan Schlobach",
year = "2017",
month = "1",
day = "1",
doi = "10.3233/SW-170256",
language = "English",
volume = "8",
pages = "1067--1080",
journal = "Semantic Web",
issn = "1570-0844",
publisher = "IOS Press",
number = "6",

}

Rietveld, L, Beek, W, Hoekstra, R & Schlobach, S 2017, 'Meta-data for a lot of LOD', Semantic Web, vol. 8, no. 6, pp. 1067-1080. https://doi.org/10.3233/SW-170256

Meta-data for a lot of LOD. / Rietveld, Laurens; Beek, Wouter; Hoekstra, Rinke; Schlobach, Stefan.

In: Semantic Web, Vol. 8, No. 6, 01.01.2017, p. 1067-1080.

Research output: Contribution to JournalArticleAcademicpeer-review

TY - JOUR

T1 - Meta-data for a lot of LOD

AU - Rietveld, Laurens

AU - Beek, Wouter

AU - Hoekstra, Rinke

AU - Schlobach, Stefan

PY - 2017/1/1

Y1 - 2017/1/1

N2 - This paper introduces the LOD Laundromat meta-dataset, a continuously updated RDF meta-dataset that describes the documents crawled, cleaned and (re)published by the LOD Laundromat. This meta-dataset of over 110 million triples contains structural information for more than 650,000 documents (and growing). Dataset meta-data is often not provided alongside published data, it is incomplete or it is incomparable given the way they were generated. The LOD Laundromat meta-dataset provides a wide range of structural dataset properties, such as the number of triples in LOD Laundromat documents, the average degree in documents, and the distinct number of Blank Nodes, Literals and IRIs. This makes it a particularly useful dataset for data comparison and analytics, as well as for the global study of the Web of Data. This paper presents the dataset, its requirements, and its impact.

AB - This paper introduces the LOD Laundromat meta-dataset, a continuously updated RDF meta-dataset that describes the documents crawled, cleaned and (re)published by the LOD Laundromat. This meta-dataset of over 110 million triples contains structural information for more than 650,000 documents (and growing). Dataset meta-data is often not provided alongside published data, it is incomplete or it is incomparable given the way they were generated. The LOD Laundromat meta-dataset provides a wide range of structural dataset properties, such as the number of triples in LOD Laundromat documents, the average degree in documents, and the distinct number of Blank Nodes, Literals and IRIs. This makes it a particularly useful dataset for data comparison and analytics, as well as for the global study of the Web of Data. This paper presents the dataset, its requirements, and its impact.

UR - http://www.scopus.com/inward/record.url?scp=85027402307&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85027402307&partnerID=8YFLogxK

U2 - 10.3233/SW-170256

DO - 10.3233/SW-170256

M3 - Article

VL - 8

SP - 1067

EP - 1080

JO - Semantic Web

T2 - Semantic Web

JF - Semantic Web

SN - 1570-0844

IS - 6

ER -