Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Aysenur Bilgin, Erik Tjong Kim Sang, Kim Smeenk, Laura Hollink, Jacco Van Ossenbruggen, Frank Harbers, Marcel Broersma

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

With the growing abundance of unlabeled data in real-world tasks, researchers have to rely on the predictions given by black-boxed computational models. However, it is an often neglected fact that these models may be scoring high on accuracy for the wrong reasons. In this paper, we present a practical impact analysis of enabling model transparency by various presentation forms. For this purpose, we developed an environment that empowers non-computer scientists to become practicing data scientists in their own research field. We demonstrate the gradually increasing understanding of journalism historians through a real-world use case study on automatic genre classification of newspaper articles. This study is a first step towards trusted usage of machine learning pipelines in a responsible way.

Original languageEnglish
Title of host publicationProceedings - IEEE 14th International Conference on eScience, e-Science 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages486-496
Number of pages11
ISBN (Electronic)9781538691564
DOIs
Publication statusPublished - 24 Dec 2018
Externally publishedYes
Event14th IEEE International Conference on eScience, e-Science 2018 - Amsterdam, Netherlands
Duration: 29 Oct 20181 Nov 2018

Publication series

NameProceedings - IEEE 14th International Conference on eScience, e-Science 2018

Conference

Conference14th IEEE International Conference on eScience, e-Science 2018
Country/TerritoryNetherlands
CityAmsterdam
Period29/10/181/11/18

Keywords

  • Genre classification
  • Journalism history
  • Machine learning
  • Transparency

Fingerprint

Dive into the research topics of 'Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history'. Together they form a unique fingerprint.

Cite this