TY - JOUR
T1 - Evaluating unsupervised thesaurus-based labeling of audiovisual content in an archive production environment
AU - de Boer, V.
AU - Ordelman, Roeland J.
AU - Schuurman, Josefien
PY - 2016/9/1
Y1 - 2016/9/1
N2 - In this paper we report on a two-stage evaluation of unsupervised labeling of audiovisual content using collateral text data sources to investigate how such an approach can provide acceptable results for given requirements with respect to archival quality, authority and service levels to external users. We conclude that with parameter settings that are optimized using a rigorous evaluation of precision and accuracy, the quality of automatic term-suggestion is sufficiently high. We furthermore provide an analysis of the term extraction after being taken into production, where we focus on performance variation with respect to term types and television programs. Having implemented the procedure in our production work-flow allows us to gradually develop the system further and to also assess the effect of the transformation from manual to automatic annotation from an end-user perspective. Additional future work will be on deploying different information sources including annotations based on multimodal video analysis such as speaker recognition and computer vision.
AB - In this paper we report on a two-stage evaluation of unsupervised labeling of audiovisual content using collateral text data sources to investigate how such an approach can provide acceptable results for given requirements with respect to archival quality, authority and service levels to external users. We conclude that with parameter settings that are optimized using a rigorous evaluation of precision and accuracy, the quality of automatic term-suggestion is sufficiently high. We furthermore provide an analysis of the term extraction after being taken into production, where we focus on performance variation with respect to term types and television programs. Having implemented the procedure in our production work-flow allows us to gradually develop the system further and to also assess the effect of the transformation from manual to automatic annotation from an end-user perspective. Additional future work will be on deploying different information sources including annotations based on multimodal video analysis such as speaker recognition and computer vision.
KW - Audiovisual access
KW - Audiovisual archives
KW - Information extraction
KW - Practice-oriented evaluation
KW - Thesaurus
UR - http://www.scopus.com/inward/record.url?scp=84976331378&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84976331378&partnerID=8YFLogxK
U2 - 10.1007/s00799-016-0182-6
DO - 10.1007/s00799-016-0182-6
M3 - Article
SN - 1432-5012
VL - 17
SP - 189
EP - 201
JO - International Journal on Digital Libraries
JF - International Journal on Digital Libraries
IS - 3
ER -