Abstract
Image description datasets, such as Flickr30K and MS COCO, show a high degree of variation in the ways that crowd-workers talk about the world. Although this gives us a rich and diverse collection of data to work with, it also introduces uncertainty about how the world should be described. This paper shows the extent of this uncertainty in the PEOPLE domain. We present a taxonomy of different ways to talk about other people. This taxonomy serves as a reference point to think about how other people should be described, and can be used to classify and compute statistics about labels applied to people.
Original language | English |
---|---|
Title of host publication | Proceedings of the 11th International Conference on Natural Language Generation |
Place of Publication | Tilburg |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 415-420 |
Number of pages | 6 |
ISBN (Electronic) | 9781948087865 |
Publication status | Published - Nov 2018 |
Event | 11th International Natural Language Generation Conference, INLG 2018 - Tilburg, Netherlands Duration: 5 Nov 2018 → 8 Nov 2018 |
Conference
Conference | 11th International Natural Language Generation Conference, INLG 2018 |
---|---|
Country/Territory | Netherlands |
City | Tilburg |
Period | 5/11/18 → 8/11/18 |
Funding
This paper was written while the first author was at the Vrije Universiteit Amsterdam, supported by the NWO Spinoza Prize awarded to Piek Vossen.
Funders | Funder number |
---|---|
Nederlandse Organisatie voor Wetenschappelijk Onderzoek |