Quantifying the bias in data links

Ilaria Tiddi*, Mathieu D’aquin, Enrico Motta

*Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

The main idea behind Linked Data is to connect data from different sources together, in order to develop a hub of shared and publicly accessible knowledge. While the benefit of sharing knowledge is universally recognised, what is less visible is how much results can be affected when the knowledge in one dataset and in the connected ones are not equally distributed. This lack of balance in information, or bias, generally assumed a priori, can actually be quantified to improve the quality of the results of applications and analytics relying on such linked data. In this paper, we propose a process to measure how much bias one dataset contains when compared to another one, by identifying the most affected RDF properties and values within the set of entities that those datasets have in common (defined as the linkset). This process was ran on a wide range of linksets from Linked Data, and in the experiment section we present the results as well as measures of its performance.

Original languageEnglish
Title of host publicationKnowledge Engineering and Knowledge Management - 19th International Conference, EKAW 2014, Proceedings
EditorsStefan Schlobach, Krzysztof Janowicz, Eero Hyvönen, Patrick Lambrix
PublisherSpringer Verlag
Pages531-546
Number of pages16
ISBN (Electronic)9783319137032
Publication statusPublished - 1 Jan 2014
Externally publishedYes
Event19th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2014 - Linköping, Sweden
Duration: 24 Nov 201428 Nov 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8876
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2014
Country/TerritorySweden
CityLinköping
Period24/11/1428/11/14

Keywords

  • Datasets bias
  • Linked Data
  • Linksets Analysis

Fingerprint

Dive into the research topics of 'Quantifying the bias in data links'. Together they form a unique fingerprint.

Cite this