Abstract
Large-scale survey tools enable the collection of citizen feedback in opinion corpora. Extracting the key arguments from a large and noisy set of opinions helps in understanding the opinions quickly and accurately. Fully automated methods can extract arguments but (1) require large labeled datasets that induce large annotation costs and (2) work well for known viewpoints, but not for novel points of view. We propose HyEnA, a hybrid (human + AI) method for extracting arguments from opinionated texts, combining the speed of automated processing with the understanding and reasoning capabilities of humans. We evaluate HyEnA on three citizen feedback corpora. We find that, on the one hand, HyEnA achieves higher coverage and precision than a state-of-The-Art automated method when compared to a common set of diverse opinions, justifying the need for human insight. On the other hand, HyEnA requires less human effort and does not compromise quality compared to (fully manual) expert analysis, demonstrating the benefit of combining human and artificial intelligence.
Original language | English |
---|---|
Pages (from-to) | 1187-1222 |
Number of pages | 36 |
Journal | Journal of Artificial Intelligence Research |
Volume | 80 |
Early online date | 31 Jul 2024 |
DOIs | |
Publication status | Published - 2024 |
Bibliographical note
Publisher Copyright:© 2024 The Authors.
Funding
Acknowledgements This research was (partially) funded by the Hybrid Intelligence Center, a 10-year program funded by the Dutch Ministry of Education, Culture, and Science through the Netherlands Organisation for Scientific Research (NWO). We would like to thank the anonymous reviewers whose insightful comments and suggestions helped improve the paper.
Funders | Funder number |
---|---|
Dutch Ministry of Education, Culture, and Science | |
Nederlandse Organisatie voor Wetenschappelijk Onderzoek |