Matching multi-lingual subject vocabularies

S. Wang, A.H.J.C.A. Isaac, B.A.C. Schopman, K.S. Schlobach, L. van der Meij

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Most libraries and other cultural heritage institutions use controlled knowledge organisation systems, such as thesauri, to describe their collections. Unfortunately, as most of these institutions use different such systems, unified access to heterogeneous collections is difficult. Things are even worse in an international context when concepts have labels in different languages. In order to overcome the multilingual interoperability problem between European Libraries, extensive work has been done to manually map concepts from different knowledge organisation systems, which is a tedious and expensive process.

Within the TELplus project, we developed and evaluated methods to automatically discover these mappings, using different ontology matching techniques. In experiments on major French, English and German subject heading lists Rameau, LCSH and SWD, we show that we can automatically produce mappings of surprisingly good quality, even when using relatively naive translation and matching methods.
Original languageEnglish
Title of host publicationResearch and Advanced Technology for Digital Libraries
Subtitle of host publication13th European Conference, ECDL 2009Corfu, Greece, September 27 – October 2, 2009, Proceedings
EditorsMaristella Agosti
PublisherSpringer Verlag
Pages125-137
Number of pages13
ISBN (Print)9783642043451, 3642043453
DOIs
Publication statusPublished - 2009
Event13th European Conference on Digital Libraries (ECDL 2009) -
Duration: 1 Jan 20091 Jan 2009

Publication series

NameLecture Notes in Computer Science
Number5714
ISSN (Print)0302-9743

Conference

Conference13th European Conference on Digital Libraries (ECDL 2009)
Period1/01/091/01/09

Fingerprint

Thesauri
Interoperability
Ontology
Labels
Experiments

Cite this

Wang, S., Isaac, A. H. J. C. A., Schopman, B. A. C., Schlobach, K. S., & van der Meij, L. (2009). Matching multi-lingual subject vocabularies. In M. Agosti (Ed.), Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009Corfu, Greece, September 27 – October 2, 2009, Proceedings (pp. 125-137). (Lecture Notes in Computer Science; No. 5714). Springer Verlag. https://doi.org/10.1007/978-3-642-04346-8_14
Wang, S. ; Isaac, A.H.J.C.A. ; Schopman, B.A.C. ; Schlobach, K.S. ; van der Meij, L. / Matching multi-lingual subject vocabularies. Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009Corfu, Greece, September 27 – October 2, 2009, Proceedings. editor / Maristella Agosti. Springer Verlag, 2009. pp. 125-137 (Lecture Notes in Computer Science; 5714).
@inproceedings{b5da14791cd942ebaf6991c4e7043a48,
title = "Matching multi-lingual subject vocabularies",
abstract = "Most libraries and other cultural heritage institutions use controlled knowledge organisation systems, such as thesauri, to describe their collections. Unfortunately, as most of these institutions use different such systems, unified access to heterogeneous collections is difficult. Things are even worse in an international context when concepts have labels in different languages. In order to overcome the multilingual interoperability problem between European Libraries, extensive work has been done to manually map concepts from different knowledge organisation systems, which is a tedious and expensive process.Within the TELplus project, we developed and evaluated methods to automatically discover these mappings, using different ontology matching techniques. In experiments on major French, English and German subject heading lists Rameau, LCSH and SWD, we show that we can automatically produce mappings of surprisingly good quality, even when using relatively naive translation and matching methods.",
author = "S. Wang and A.H.J.C.A. Isaac and B.A.C. Schopman and K.S. Schlobach and {van der Meij}, L.",
year = "2009",
doi = "10.1007/978-3-642-04346-8_14",
language = "English",
isbn = "9783642043451",
series = "Lecture Notes in Computer Science",
publisher = "Springer Verlag",
number = "5714",
pages = "125--137",
editor = "Maristella Agosti",
booktitle = "Research and Advanced Technology for Digital Libraries",

}

Wang, S, Isaac, AHJCA, Schopman, BAC, Schlobach, KS & van der Meij, L 2009, Matching multi-lingual subject vocabularies. in M Agosti (ed.), Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009Corfu, Greece, September 27 – October 2, 2009, Proceedings. Lecture Notes in Computer Science, no. 5714, Springer Verlag, pp. 125-137, 13th European Conference on Digital Libraries (ECDL 2009), 1/01/09. https://doi.org/10.1007/978-3-642-04346-8_14

Matching multi-lingual subject vocabularies. / Wang, S.; Isaac, A.H.J.C.A.; Schopman, B.A.C.; Schlobach, K.S.; van der Meij, L.

Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009Corfu, Greece, September 27 – October 2, 2009, Proceedings. ed. / Maristella Agosti. Springer Verlag, 2009. p. 125-137 (Lecture Notes in Computer Science; No. 5714).

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Matching multi-lingual subject vocabularies

AU - Wang, S.

AU - Isaac, A.H.J.C.A.

AU - Schopman, B.A.C.

AU - Schlobach, K.S.

AU - van der Meij, L.

PY - 2009

Y1 - 2009

N2 - Most libraries and other cultural heritage institutions use controlled knowledge organisation systems, such as thesauri, to describe their collections. Unfortunately, as most of these institutions use different such systems, unified access to heterogeneous collections is difficult. Things are even worse in an international context when concepts have labels in different languages. In order to overcome the multilingual interoperability problem between European Libraries, extensive work has been done to manually map concepts from different knowledge organisation systems, which is a tedious and expensive process.Within the TELplus project, we developed and evaluated methods to automatically discover these mappings, using different ontology matching techniques. In experiments on major French, English and German subject heading lists Rameau, LCSH and SWD, we show that we can automatically produce mappings of surprisingly good quality, even when using relatively naive translation and matching methods.

AB - Most libraries and other cultural heritage institutions use controlled knowledge organisation systems, such as thesauri, to describe their collections. Unfortunately, as most of these institutions use different such systems, unified access to heterogeneous collections is difficult. Things are even worse in an international context when concepts have labels in different languages. In order to overcome the multilingual interoperability problem between European Libraries, extensive work has been done to manually map concepts from different knowledge organisation systems, which is a tedious and expensive process.Within the TELplus project, we developed and evaluated methods to automatically discover these mappings, using different ontology matching techniques. In experiments on major French, English and German subject heading lists Rameau, LCSH and SWD, we show that we can automatically produce mappings of surprisingly good quality, even when using relatively naive translation and matching methods.

U2 - 10.1007/978-3-642-04346-8_14

DO - 10.1007/978-3-642-04346-8_14

M3 - Conference contribution

SN - 9783642043451

SN - 3642043453

T3 - Lecture Notes in Computer Science

SP - 125

EP - 137

BT - Research and Advanced Technology for Digital Libraries

A2 - Agosti, Maristella

PB - Springer Verlag

ER -

Wang S, Isaac AHJCA, Schopman BAC, Schlobach KS, van der Meij L. Matching multi-lingual subject vocabularies. In Agosti M, editor, Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009Corfu, Greece, September 27 – October 2, 2009, Proceedings. Springer Verlag. 2009. p. 125-137. (Lecture Notes in Computer Science; 5714). https://doi.org/10.1007/978-3-642-04346-8_14