The Open Cantonese Sense-Tagged Corpus

Joanna Ut Seong Sio, Luis Morgado da Costa

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

This paper introduces the Open Cantonese Sense-Tagged Corpus, a new and ongoing project to serve as the companion to the development of the Cantonese Wordnet. This corpus is built on top of the Cantonese Wordnet Corpus, which currently provides example sentences for most verbs in this wordnet. This paper motivates the choice of starting a sense-tagged corpus from both linguistic and educational perspectives, and discusses the current solutions to issues arisen from the sense-tagging exercise. In total, we have tagged over 5,000 concepts, with more than 3,700 direct links to the Cantonese Wordnet.

Original languageEnglish
Title of host publicationProceedings of the 12th Global Wordnet Conference
EditorsGerman Rigau, Francis Bond, Alexandre Rademaker
PublisherAssociation for Computational Linguistics (ACL)
Pages263-268
Number of pages6
ISBN (Electronic)9781713890881
Publication statusPublished - 2023
Event12th Global Wordnet Conference, GWC 2023 - Donositia-San Sebastian, Spain
Duration: 23 Jan 202327 Jan 2023

Conference

Conference12th Global Wordnet Conference, GWC 2023
Country/TerritorySpain
CityDonositia-San Sebastian
Period23/01/2327/01/23

Bibliographical note

Publisher Copyright:
© 2023 12th Global Wordnet Conference, GWC 2023. All rights reserved.

Fingerprint

Dive into the research topics of 'The Open Cantonese Sense-Tagged Corpus'. Together they form a unique fingerprint.

Cite this