Abstract
This paper introduces the Open Cantonese Sense-Tagged Corpus, a new and ongoing project to serve as the companion to the development of the Cantonese Wordnet. This corpus is built on top of the Cantonese Wordnet Corpus, which currently provides example sentences for most verbs in this wordnet. This paper motivates the choice of starting a sense-tagged corpus from both linguistic and educational perspectives, and discusses the current solutions to issues arisen from the sense-tagging exercise. In total, we have tagged over 5,000 concepts, with more than 3,700 direct links to the Cantonese Wordnet.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 12th Global Wordnet Conference |
| Editors | German Rigau, Francis Bond, Alexandre Rademaker |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 263-268 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781713890881 |
| Publication status | Published - 2023 |
| Event | 12th Global Wordnet Conference, GWC 2023 - Donositia-San Sebastian, Spain Duration: 23 Jan 2023 → 27 Jan 2023 |
Conference
| Conference | 12th Global Wordnet Conference, GWC 2023 |
|---|---|
| Country/Territory | Spain |
| City | Donositia-San Sebastian |
| Period | 23/01/23 → 27/01/23 |
Bibliographical note
Publisher Copyright:© 2023 12th Global Wordnet Conference, GWC 2023. All rights reserved.