An Exploration of Zero-Shot Natural Language Inference-Based Hate Speech Detection

Nerses Yuzbashyan, Nikolay Banar, Ilia Markov, Walter Daelemans

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Conventional techniques for detecting online hate speech rely on the availability of a sufficient number of annotated instances, which can be costly and time consuming. For this reason, zero-shot or few-shot detection can offer an attractive alternative. In this paper, we explore a zero-shot detection approach based on natural language inference (NLI) models. The performance of the models in this approach depends heavily on the choice of a hypothesis, which represents a statement that is evaluated with a given sentence to determine the logical relationship between them. Our goal is to determine which factors affect the quality of detection. We conducted a set of experiments with three NLI models and four hate speech datasets. We demonstrate that a zero-shot NLI-based approach is competitive with approaches that require supervised learning, yet they are highly sensitive to the choice of hypothesis. In addition, our experiments indicate that the results for a set of hypotheses on different model-data pairs are positively correlated, and that the correlation is higher for different datasets when using the same model than it is for different models when using the same dataset. These results suggest that if we find a hypothesis that works well for a specific model and domain or for a specific type of hate speech, we can use that hypothesis with the same model also within a different domain. While another model might require different suitable hypotheses in order to demonstrate high performance.

Original languageEnglish
Title of host publicationProceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion
EditorsBharathi Raja Chakravarthi, B. Bharathi, Joephine Griffith, Kalika Bali, Paul Buitelaar
PublisherIncoma Ltd.
Pages1-9
Number of pages9
ISBN (Electronic)9789544520847
DOIs
Publication statusPublished - 2023
Event3rd Workshop on Language Technology for Equality, Diversity and Inclusion, LTEDI 2023 - Varna, Bulgaria
Duration: 7 Sept 2023 → …

Conference

Conference3rd Workshop on Language Technology for Equality, Diversity and Inclusion, LTEDI 2023
Country/TerritoryBulgaria
CityVarna
Period7/09/23 → …

Bibliographical note

Publisher Copyright:
© 2023 LTEDI 2023 - 3rd Workshop on Language Technology for Equality, Diversity and Inclusion, associated with the 14th International Conference on Recent Advances in Natural Language Processing, RANLP 2023 - Proceedings. All rights reserved.

Fingerprint

Dive into the research topics of 'An Exploration of Zero-Shot Natural Language Inference-Based Hate Speech Detection'. Together they form a unique fingerprint.

Cite this