Abstract
In this paper, we present a system for detecting complex named entities in multilingual and code-mix settings. We discuss the results obtained in task 11 (MultiCoNER) of the SemEval 2022 competition. The model is an ensemble of various transformer-based language models combined with a Conditional Random Field (CRF) layer. Our model ranks fourth in track 12 (multilingual track) and fifth in track 13 (code-mixed track). We describe the details of our model implementation and discuss the effect of different aggregation methods. Finally, we conduct additional analyses to understand the performance differences between languages.
Original language | English |
---|---|
Title of host publication | Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) |
Editors | Guy Emerson, Natalie Schluter, Gabriel Stanovsky, Ritesh Kumar, Alexis Palmer, Nathan Schneider, Siddharth Singh, Shyam Ratan |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 1583-1592 |
Number of pages | 10 |
ISBN (Electronic) | 9781955917803 |
DOIs | |
Publication status | Published - Jul 2022 |
Event | 16th International Workshop on Semantic Evaluation, SemEval 2022 - Seattle, United States Duration: 14 Jul 2022 → 15 Jul 2022 |
Conference
Conference | 16th International Workshop on Semantic Evaluation, SemEval 2022 |
---|---|
Country/Territory | United States |
City | Seattle |
Period | 14/07/22 → 15/07/22 |
Bibliographical note
Funding Information:The research by the authors of affiliation 1 and by Wondimagegnhue Tufa was funded by Huawei Finland.
Publisher Copyright:
© 2022 Association for Computational Linguistics.
Funding
The research by the authors of affiliation 1 and by Wondimagegnhue Tufa was funded by Huawei Finland.