Content Moderation in Online Platforms: A Study of Annotation Methods for Inappropriate Language

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Detecting inappropriate language in online platforms is vital for maintaining a safe and respectful digital environment, especially in the context of hate speech prevention. However, defining what constitutes inappropriate language can be highly subjective and context-dependent, varying from person to person. This study presents the outcomes of a comprehensive examination of the subjectivity involved in assessing inappropriateness within conversational contexts. Different annotation methods, including expert annotation, crowd annotation, ChatGPT-generated annotation, and lexicon-based annotation, were applied to English Reddit conversations. The analysis revealed a high level of agreement across these annotation methods, with most disagreements arising from subjective interpretations of inappropriate language. This emphasizes the importance of implementing content moderation systems that not only recognize inappropriate content but also understand and adapt to diverse user perspectives and contexts. The study contributes to the evolving field of hate speech annotation by providing a detailed analysis of annotation differences in relation to the subjective task of judging inappropriate words in conversations.

Original languageEnglish
Title of host publicationProceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024
EditorsRitesh Kumar, Atul Kr. Ojha, Atul Kr. Ojha, Shervin Malmasi, Bharathi Raja Chakravarthi, Bornini Lahiri, Siddharth Singh, Shyam Ratan
PublisherELRA and ICCL
Pages96-104
Number of pages9
ISBN (Electronic)9782493814470
Publication statusPublished - 2024
Event4th Workshop on Threat, Aggression and Cyberbullying, TRAC 2024 - Torino, Italy
Duration: 20 May 2024 → …

Conference

Conference4th Workshop on Threat, Aggression and Cyberbullying, TRAC 2024
Country/TerritoryItaly
CityTorino
Period20/05/24 → …

Bibliographical note

Publisher Copyright:
© 2024 ELRA Language Resource Association.

Keywords

  • Inappropriate Language
  • Online Content Moderation
  • Subjectivity in Annotation

Fingerprint

Dive into the research topics of 'Content Moderation in Online Platforms: A Study of Annotation Methods for Inappropriate Language'. Together they form a unique fingerprint.

Cite this