Contextual entity disambiguation in domains with weak identity criteria: Disambiguating golden age amsterdamers

Al Idrissou, Veruska Zamborlini, Frank Van Harmelen, Chiara Latronico

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Entity disambiguation is a widely investigated topic, and many matching algorithms have been proposed. However, this task has not yet been satisfactorily addressed when the domain of interest provides poor or incomplete data with little discriminating power. In these cases, the use of content fields such as name and date is not enough and the simple use of relations with other entities is not of much help when these related entities also need disambiguation before they can be used. Therefore, we propose an approach for the disambiguation of clustered resources using context (related entities that are also clustered) as evidence for reconciling matched entities. We test the proposed method on datasets of historical records from Amsterdam in the 17th century for which context is available, and we compare the results of the proposed approach to a gold standard generated by three experts, which we make available online. The results show that the proposed approach manages to meaningfully use context for isolating identity sub-clusters with higher quality by eliminating potentially false positive matches.

Original languageEnglish
Title of host publicationK-CAP 2019
Subtitle of host publicationProceedings of the 10th International Conference on Knowledge Capture
PublisherAssociation for Computing Machinery, Inc
Pages259-262
Number of pages4
ISBN (Electronic)9781450370080
DOIs
Publication statusPublished - 23 Sep 2019
Event10th International Conference on Knowledge Capture, K-CAP 2019 - Marina Del Rey, United States
Duration: 19 Nov 201921 Nov 2019

Conference

Conference10th International Conference on Knowledge Capture, K-CAP 2019
CountryUnited States
CityMarina Del Rey
Period19/11/1921/11/19

Keywords

  • Data integration
  • Entity disambiguation
  • Entity reconciliation
  • Entity resolution
  • Linked data

Cite this

Idrissou, A., Zamborlini, V., Van Harmelen, F., & Latronico, C. (2019). Contextual entity disambiguation in domains with weak identity criteria: Disambiguating golden age amsterdamers. In K-CAP 2019: Proceedings of the 10th International Conference on Knowledge Capture (pp. 259-262). Association for Computing Machinery, Inc. https://doi.org/10.1145/3360901.3364440