Robust Text Classification: Analyzing Prototype-Based Networks

Zhivar Sourati, Darshan Deshpande, Filip Ilievski, Kiril Gashteovski, Sascha Saralajew

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Downstream applications often require text classification models to be accurate and robust. While the accuracy of state-of-the-art Language Models (LMs) approximates human performance, they often exhibit a drop in performance on real-world noisy data. This lack of robustness can be concerning, as even small perturbations in text, irrelevant to the target task, can cause classifiers to incorrectly change their predictions. A potential solution can be the family of Prototype-Based Networks (PBNs) that classifies examples based on their similarity to prototypical examples of a class (prototypes) and has been shown to be robust to noise for computer vision tasks. In this paper, we study whether the robustness properties of PBNs transfer to text classification tasks under both targeted and static adversarial attack settings. Our results show that PBNs, as a mere architectural variation of vanilla LMs, offer more robustness compared to vanilla LMs under both targeted and static settings. We showcase how PBNs' interpretability can help us understand PBNs' robustness properties. Finally, our ablation studies reveal the sensitivity of PBNs' robustness to the strictness of clustering and the number of prototypes in the training phase, as tighter clustering and a low number of prototypes result in less robust PBNs.

Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics: EMNLP 2024
Subtitle of host publication[Proceedings]
EditorsYaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
PublisherAssociation for Computational Linguistics (ACL)
Pages12736-12757
Number of pages22
ISBN (Electronic)9798891761681
DOIs
Publication statusPublished - 2024
Event2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024 - Hybrid, Miami, United States
Duration: 12 Nov 202416 Nov 2024

Conference

Conference2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024
Country/TerritoryUnited States
CityHybrid, Miami
Period12/11/2416/11/24

Bibliographical note

Publisher Copyright:
© 2024 Association for Computational Linguistics.

Funding

This research was supported, in part, by the Army Research Laboratory under contract W911NF-23-2-0183, and by the National Science Foundation under Contract No. IIS-2153546.

FundersFunder number
Army Research LaboratoryW911NF-23-2-0183
National Science FoundationIIS-2153546

    Fingerprint

    Dive into the research topics of 'Robust Text Classification: Analyzing Prototype-Based Networks'. Together they form a unique fingerprint.

    Cite this