Grounding Toxicity in Real-World Events across Languages

Research output: Working paper / PreprintPreprintAcademic

27 Downloads (Pure)

Abstract

Social media conversations frequently suffer from toxicity, creating significant issues for users, moderators, and entire communities. Events in the real world, like elections or conflicts, can initiate and escalate toxic behavior online. Our study investigates how real-world events influence the origin and spread of toxicity in online discussions across various languages and regions. We gathered Reddit data comprising 4.5 million comments from 31 thousand posts in six different languages (Dutch, English, German, Arabic, Turkish and Spanish). We target fifteen major social and political world events that occurred between 2020 and 2023. We observe significant variations in toxicity, negative sentiment, and emotion expressions across different events and language communities, showing that toxicity is a complex phenomenon in which many different factors interact and still need to be investigated. We will release the data for further research along with our code.
Original languageEnglish
PublisherarXiv.org
Publication statusPublished - 22 May 2024

Bibliographical note

Paper accepted for at The 29th International Conference on Natural Language & Information Systems (NLDB 2024)

Keywords

  • cs.CL

Fingerprint

Dive into the research topics of 'Grounding Toxicity in Real-World Events across Languages'. Together they form a unique fingerprint.
  • Grounding Toxicity in Real-World Events Across Languages

    Tufa, W. T., Markov, I. & Vossen, P., 2024, Natural Language Processing and Information Systems: 29th International Conference on Applications of Natural Language to Information Systems, NLDB 2024, Turin, Italy, June 25–27, 2024, Proceedings, Part I. Rapp, A., Di Caro, L., Meziane, F. & Sugumaran, V. (eds.). Springer Science and Business Media Deutschland GmbH, Vol. 1. p. 197-210 14 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 14762 LNCS)(NLDB: International Conference on Applications of Natural Language to Information Systems).

    Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

    Open Access
    File
    7 Downloads (Pure)

Cite this