Projects per year
Abstract
With the continuous growth of benchmark corpora, which often annotate the same documents, there is a range of opportunities to compare and combine similar and complementary annotations. However, these opportunities are hampered by a wide range of problems that are related to the lack of resource interoperability. In this paper, we illustrate these problems by assessing aspects of interoperability at the document-level across a set of 20 corpora annotated with (aspects of) events. The issues range from applying different document naming conventions, to mismatches in textual content and structural/conceptual differences among annotation schemes. We provide insight into the exact document intersections between the corpora by mapping their document identifiers and perform an empirical analysis of event annotations showing their compatibility and consistency in and across the corpora. This way, we aim to make the community more aware of the challenges and opportunities and to inspire working collaboratively towards interoperable resources.
Original language | English |
---|---|
Title of host publication | Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018) |
Editors | Hitoshi Isahara, Bente Maegaard, Stelios Piperidis, Christopher Cieri, Thierry Declerck, Koiti Hasida, Helene Mazo, Khalid Choukri, Sara Goggi, Joseph Mariani, Asuncion Moreno, Nicoletta Calzolari, Jan Odijk, Takenobu Tokunaga |
Place of Publication | Miyazaki |
Publisher | European Language Resources Association (ELRA) |
Pages | 1101-1111 |
Number of pages | 11 |
ISBN (Electronic) | 9791095546009 |
Publication status | Published - 2018 |
Event | 11th International Conference on Language Resources and Evaluation, LREC 2018 - Miyazaki, Japan Duration: 7 May 2018 → 12 May 2018 |
Conference
Conference | 11th International Conference on Language Resources and Evaluation, LREC 2018 |
---|---|
Country/Territory | Japan |
City | Miyazaki |
Period | 7/05/18 → 12/05/18 |
Keywords
- Annotation consistency
- Events
- Resource interoperability
Fingerprint
Dive into the research topics of 'Resource Interoperability for Sustainable Benchmarking: The Case of Events: The case of events'. Together they form a unique fingerprint.-
Understanding of Language by Machines
Segers, R. H., Vossen, P., Baez Santamaria, S. & Cybulska, A.
1/09/13 → 1/01/25
Project: Research
-
Storylines and perspectives — Understanding of Language by Machines
van Son, C. M., Morante Vallejo, R., Caselli, T. & Vossen, P.
1/05/14 → 1/01/20
Project: Research