Building Cross-language Corpora for Human Understanding of Privacy Policies

Francesco Ciclosi*, Silvia Vidor, Fabio Massacci

*Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

5 Downloads (Pure)

Abstract

Making sure that users understand privacy policies that impact them is a key challenge for a real GDPR deployment. Research studies are mostly carried in English, but in Europe and elsewhere, users speak a language that is not English. Replicating studies in different languages requires the availability of comparable cross-language privacy policies corpora. This work provides a methodology for building comparable cross-language in a national language and a reference study language. We provide an application example of our methodology comparing English and Italian extending the corpus of one of the first studies about users understanding of technical terms in privacy policies. We also investigate other open issues that can make replication harder.

Original languageEnglish
Title of host publicationDigital Sovereignty in Cyber Security
Subtitle of host publicationFirst International Workshop, CyberSec4Europe 2022, Venice, Italy, April 17–21, 2022, Revised Selected Papers
EditorsAntonio Skarmeta, Sara Matheu, Antonio Lioy, Daniele Canavese
PublisherSpringer Science and Business Media Deutschland GmbH
Pages113-131
Number of pages19
ISBN (Electronic)9783031360961
ISBN (Print)9783031360954
DOIs
Publication statusPublished - 2023
Event1st International Workshop on Digital Sovereignty in Cyber Security: New Challenges in Future Vision, CyberSec4Europe 2022 - Venice, Italy
Duration: 17 Apr 202221 Apr 2022

Publication series

NameCommunications in Computer and Information Science (CCIS)
PublisherSpringer
Volume1807
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference1st International Workshop on Digital Sovereignty in Cyber Security: New Challenges in Future Vision, CyberSec4Europe 2022
Country/TerritoryItaly
CityVenice
Period17/04/2221/04/22

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

Funding

The authors would like to thank Eleanor Birrell and Ada Lerner for providing us their raw privacy corpus used in their paper [24]. Without their time and expertise this paper would not have been possible. This work was supported in part by the EU under the H2020 Leadership in Enabling and Industrial Technologies program under grant agreement 830929 (CyberSec4Europe). Acknowledgement. The authors would like to thank Eleanor Birrell and Ada Lerner for providing us their raw privacy corpus used in their paper [24]. Without their time and expertise this paper would not have been possible. This work was supported in part by the EU under the H2020 Leadership in Enabling and Industrial Technologies program under grant agreement 830929 (CyberSec4Europe).

FundersFunder number
Eleanor Birrell and Ada Lerner
European Commission830929
European Commission

    Keywords

    • Comparable corpora
    • Cross-language corpora
    • Evaluation
    • Methodology
    • Privacy Policies

    Fingerprint

    Dive into the research topics of 'Building Cross-language Corpora for Human Understanding of Privacy Policies'. Together they form a unique fingerprint.

    Cite this