SchemaTree: Maximum-Likelihood Property Recommendation for Wikidata

Lars C. Gleim*, Rafael Schimassek, Dominik Hüser, Maximilian Peters, Christoph Krämer, Michael Cochez, Stefan Decker

*Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

78 Downloads (Pure)

Abstract

Wikidata is a free and open knowledge base which can be read and edited by both humans and machines. It acts as a central storage for the structured data of several Wikimedia projects. To improve the process of manually inserting new facts, the Wikidata platform features an association rule-based tool to recommend additional suitable properties. In this work, we introduce a novel approach to provide such recommendations based on frequentist inference. We introduce a trie-based method that can efficiently learn and represent property set probabilities in RDF graphs. We extend the method by adding type information to improve recommendation precision and introduce backoff strategies which further increase the performance of the initial approach for entities with rare property combinations. We investigate how the captured structure can be employed for property recommendation, analogously to the Wikidata PropertySuggester. We evaluate our approach on the full Wikidata dataset and compare its performance to the state-of-the-art Wikidata PropertySuggester, outperforming it in all evaluated metrics. Notably we could reduce the average rank of the first relevant recommendation by 71%.

Original languageEnglish
Title of host publicationThe Semantic Web
Subtitle of host publication17th International Conference, ESWC 2020, Heraklion, Crete, Greece, May 31–June 4, 2020, Proceedings
EditorsAndreas Harth, Sabrina Kirrane, Axel-Cyrille Ngonga Ngomo, Heiko Paulheim, Anisa Rula, Anna Lisa Gentile, Peter Haase, Michael Cochez
PublisherSpringer
Pages179-195
Number of pages17
ISBN (Electronic)9783030494612
ISBN (Print)9783030494605
DOIs
Publication statusPublished - 2020
Event17th Extended Semantic Web Conference, ESWC 2020 - Heraklion, Greece
Duration: 31 May 20204 Jun 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12123 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th Extended Semantic Web Conference, ESWC 2020
Country/TerritoryGreece
CityHeraklion
Period31/05/204/06/20

Keywords

  • Frequent pattern mining
  • Knowledge graph editing
  • Recommender systems
  • Statistical property recommendation
  • Wikidata

Fingerprint

Dive into the research topics of 'SchemaTree: Maximum-Likelihood Property Recommendation for Wikidata'. Together they form a unique fingerprint.

Cite this