Abstract
Conventional techniques for detecting online hate speech rely on the availability of a sufficient number of annotated instances, which can be costly and time consuming. For this reason, zero-shot or few-shot detection can offer an attractive alternative. In this paper, we explore a zero-shot detection approach based on natural language inference (NLI) models. The performance of the models in this approach depends heavily on the choice of a hypothesis, which represents a statement that is evaluated with a given sentence to determine the logical relationship between them. Our goal is to determine which factors affect the quality of detection. We conducted a set of experiments with three NLI models and four hate speech datasets. We demonstrate that a zero-shot NLI-based approach is competitive with approaches that require supervised learning, yet they are highly sensitive to the choice of hypothesis. In addition, our experiments indicate that the results for a set of hypotheses on different model-data pairs are positively correlated, and that the correlation is higher for different datasets when using the same model than it is for different models when using the same dataset. These results suggest that if we find a hypothesis that works well for a specific model and domain or for a specific type of hate speech, we can use that hypothesis with the same model also within a different domain. While another model might require different suitable hypotheses in order to demonstrate high performance.
Original language | English |
---|---|
Title of host publication | Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion |
Editors | Bharathi Raja Chakravarthi, B. Bharathi, Joephine Griffith, Kalika Bali, Paul Buitelaar |
Publisher | Incoma Ltd. |
Pages | 1-9 |
Number of pages | 9 |
ISBN (Electronic) | 9789544520847 |
DOIs | |
Publication status | Published - 2023 |
Event | 3rd Workshop on Language Technology for Equality, Diversity and Inclusion, LTEDI 2023 - Varna, Bulgaria Duration: 7 Sept 2023 → … |
Conference
Conference | 3rd Workshop on Language Technology for Equality, Diversity and Inclusion, LTEDI 2023 |
---|---|
Country/Territory | Bulgaria |
City | Varna |
Period | 7/09/23 → … |
Bibliographical note
Publisher Copyright:© 2023 LTEDI 2023 - 3rd Workshop on Language Technology for Equality, Diversity and Inclusion, associated with the 14th International Conference on Recent Advances in Natural Language Processing, RANLP 2023 - Proceedings. All rights reserved.