Cross-linguistic differences and similarities in image descriptions

C.W.J. van Miltenburg, Desmond Elliott, P.T.J.M. Vossen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Automatic image description systems are commonly trained and evaluated on large image description datasets. Recently, researchers have started to collect such datasets for languages other than English. An unexplored question is how different these datasets are from English and, if there are any differences, what causes them to differ. This paper provides a cross-linguistic comparison of Dutch, English, and German image descriptions. We find that these descriptions are similar in many respects, but the familiarity of crowd workers with the subjects of the images has a noticeable influence on the specificity of the descriptions.
LanguageEnglish
Title of host publicationProceedings of the 10th International Conference on Natural Language Generation
Place of PublicationSantiago de Compostela, Spain
PublisherAssociation for Computational Linguistics
Pages21-30
Number of pages10
StatePublished - 1 Sep 2017

Fingerprint

Linguistics

Cite this

van Miltenburg, C. W. J., Elliott, D., & Vossen, P. T. J. M. (2017). Cross-linguistic differences and similarities in image descriptions. In Proceedings of the 10th International Conference on Natural Language Generation (pp. 21-30). Santiago de Compostela, Spain: Association for Computational Linguistics.
van Miltenburg, C.W.J. ; Elliott, Desmond ; Vossen, P.T.J.M./ Cross-linguistic differences and similarities in image descriptions. Proceedings of the 10th International Conference on Natural Language Generation. Santiago de Compostela, Spain : Association for Computational Linguistics, 2017. pp. 21-30
@inproceedings{8405b16a7d65419e84aea7c38e5f8e34,
title = "Cross-linguistic differences and similarities in image descriptions",
abstract = "Automatic image description systems are commonly trained and evaluated on large image description datasets. Recently, researchers have started to collect such datasets for languages other than English. An unexplored question is how different these datasets are from English and, if there are any differences, what causes them to differ. This paper provides a cross-linguistic comparison of Dutch, English, and German image descriptions. We find that these descriptions are similar in many respects, but the familiarity of crowd workers with the subjects of the images has a noticeable influence on the specificity of the descriptions.",
author = "{van Miltenburg}, C.W.J. and Desmond Elliott and P.T.J.M. Vossen",
year = "2017",
month = "9",
day = "1",
language = "English",
pages = "21--30",
booktitle = "Proceedings of the 10th International Conference on Natural Language Generation",
publisher = "Association for Computational Linguistics",

}

van Miltenburg, CWJ, Elliott, D & Vossen, PTJM 2017, Cross-linguistic differences and similarities in image descriptions. in Proceedings of the 10th International Conference on Natural Language Generation. Association for Computational Linguistics, Santiago de Compostela, Spain, pp. 21-30.

Cross-linguistic differences and similarities in image descriptions. / van Miltenburg, C.W.J.; Elliott, Desmond; Vossen, P.T.J.M.

Proceedings of the 10th International Conference on Natural Language Generation. Santiago de Compostela, Spain : Association for Computational Linguistics, 2017. p. 21-30.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Cross-linguistic differences and similarities in image descriptions

AU - van Miltenburg,C.W.J.

AU - Elliott,Desmond

AU - Vossen,P.T.J.M.

PY - 2017/9/1

Y1 - 2017/9/1

N2 - Automatic image description systems are commonly trained and evaluated on large image description datasets. Recently, researchers have started to collect such datasets for languages other than English. An unexplored question is how different these datasets are from English and, if there are any differences, what causes them to differ. This paper provides a cross-linguistic comparison of Dutch, English, and German image descriptions. We find that these descriptions are similar in many respects, but the familiarity of crowd workers with the subjects of the images has a noticeable influence on the specificity of the descriptions.

AB - Automatic image description systems are commonly trained and evaluated on large image description datasets. Recently, researchers have started to collect such datasets for languages other than English. An unexplored question is how different these datasets are from English and, if there are any differences, what causes them to differ. This paper provides a cross-linguistic comparison of Dutch, English, and German image descriptions. We find that these descriptions are similar in many respects, but the familiarity of crowd workers with the subjects of the images has a noticeable influence on the specificity of the descriptions.

M3 - Conference contribution

SP - 21

EP - 30

BT - Proceedings of the 10th International Conference on Natural Language Generation

PB - Association for Computational Linguistics

CY - Santiago de Compostela, Spain

ER -

van Miltenburg CWJ, Elliott D, Vossen PTJM. Cross-linguistic differences and similarities in image descriptions. In Proceedings of the 10th International Conference on Natural Language Generation. Santiago de Compostela, Spain: Association for Computational Linguistics. 2017. p. 21-30.