Abstract
In the absence of a central naming authority on the Semantic Web, it is common for different datasets to refer to the same thing by different IRIs. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, have observed that the owl:sameAs property is sometimes used incorrectly. In this paper, we show how network metrics such as the community structure of the owl:sameAs graph can be used in order to detect such possibly erroneous statements. One benefit of the here presented approach is that it can be applied to the network of owl:sameAs links itself, and does not rely on any additional knowledge. In order to illustrate its ability to scale, the approach is evaluated on the largest collection of identity links to date, containing over 558M owl:sameAs links scraped from the LOD Cloud.
Original language | English |
---|---|
Title of host publication | The Semantic Web – ISWC 2018 - 17th International Semantic Web Conference, 2018, Proceedings |
Editors | Mari Carmen Suárez-Figueroa, Valentina Presutti, Lucie-Aimee Kaffee, Elena Simperl, Marta Sabou, Denny Vrandecic, Irene Celino, Kalina Bontcheva |
Publisher | Springer/Verlag |
Pages | 391-407 |
Number of pages | 17 |
ISBN (Print) | 9783030006709 |
DOIs | |
Publication status | Published - 2018 |
Event | 17th International Semantic Web Conference, ISWC 2018 - Monterey, United States Duration: 8 Oct 2018 → 12 Oct 2018 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 11136 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 17th International Semantic Web Conference, ISWC 2018 |
---|---|
Country | United States |
City | Monterey |
Period | 8/10/18 → 12/10/18 |
Fingerprint
Keywords
- Communities
- Identity
- Linked Open Data
- Owl:sameAs
Cite this
}
Detecting erroneous identity links on the web using network metrics. / Raad, Joe; Beek, Wouter; van Harmelen, Frank; Pernelle, Nathalie; Saïs, Fatiha.
The Semantic Web – ISWC 2018 - 17th International Semantic Web Conference, 2018, Proceedings. ed. / Mari Carmen Suárez-Figueroa; Valentina Presutti; Lucie-Aimee Kaffee; Elena Simperl; Marta Sabou; Denny Vrandecic; Irene Celino; Kalina Bontcheva. Springer/Verlag, 2018. p. 391-407 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11136 LNCS).Research output: Chapter in Book / Report / Conference proceeding › Conference contribution › Academic › peer-review
TY - GEN
T1 - Detecting erroneous identity links on the web using network metrics
AU - Raad, Joe
AU - Beek, Wouter
AU - van Harmelen, Frank
AU - Pernelle, Nathalie
AU - Saïs, Fatiha
PY - 2018
Y1 - 2018
N2 - In the absence of a central naming authority on the Semantic Web, it is common for different datasets to refer to the same thing by different IRIs. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, have observed that the owl:sameAs property is sometimes used incorrectly. In this paper, we show how network metrics such as the community structure of the owl:sameAs graph can be used in order to detect such possibly erroneous statements. One benefit of the here presented approach is that it can be applied to the network of owl:sameAs links itself, and does not rely on any additional knowledge. In order to illustrate its ability to scale, the approach is evaluated on the largest collection of identity links to date, containing over 558M owl:sameAs links scraped from the LOD Cloud.
AB - In the absence of a central naming authority on the Semantic Web, it is common for different datasets to refer to the same thing by different IRIs. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, have observed that the owl:sameAs property is sometimes used incorrectly. In this paper, we show how network metrics such as the community structure of the owl:sameAs graph can be used in order to detect such possibly erroneous statements. One benefit of the here presented approach is that it can be applied to the network of owl:sameAs links itself, and does not rely on any additional knowledge. In order to illustrate its ability to scale, the approach is evaluated on the largest collection of identity links to date, containing over 558M owl:sameAs links scraped from the LOD Cloud.
KW - Communities
KW - Identity
KW - Linked Open Data
KW - Owl:sameAs
UR - http://www.scopus.com/inward/record.url?scp=85054815258&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054815258&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-00671-6_23
DO - 10.1007/978-3-030-00671-6_23
M3 - Conference contribution
SN - 9783030006709
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 391
EP - 407
BT - The Semantic Web – ISWC 2018 - 17th International Semantic Web Conference, 2018, Proceedings
A2 - Suárez-Figueroa, Mari Carmen
A2 - Presutti, Valentina
A2 - Kaffee, Lucie-Aimee
A2 - Simperl, Elena
A2 - Sabou, Marta
A2 - Vrandecic, Denny
A2 - Celino, Irene
A2 - Bontcheva, Kalina
PB - Springer/Verlag
ER -