Projects per year
Abstract
We apply computational stylometric techniques to an 18th century Dutch chronicle to determine which fragments of the manuscript represent the author’s own original work and which show signs of external source use through either direct copying or paraphrasing. Through stylometric methods the majority of text fragments in the chronicle can be correctly labelled as either the author’s own words, direct copies from sources or paraphrasing. Our results show that clustering text fragments based on stylometric measures is an effective methodology for authorship verification of this document; however, this approach is less effective when personal writing style is masked by author independent styles or when applied to paraphrased text.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022) |
| Place of Publication | Marseille |
| Publisher | European Language Resources Association (ELRA) |
| Pages | 5865-78 |
| Number of pages | 14 |
| Publication status | Published - 2022 |
Fingerprint
Dive into the research topics of 'Identifying Copied Fragments in an 18th Century Dutch Chronicle'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Chronicling Novelty. New knowledge in the Netherlands, 1500-1850.
Kuijpers, E. (Principal Investigator), Morante Vallejo, R. (Project Researcher) & Romein, A. (Project Researcher)
1/01/19 → 31/03/24
Project: Research
Datasets
-
chroniclingnovelty/stylometry-lrec22: Data and code of the LREC 22 publication Identifying Copied Fragments in an 18th Century Dutch Chronicle
ESmith368 (Creator), rmorante (Creator), Kuijpers, E. (Creator), Lassche, A. (Creator) & Wilhelmus, L. (Creator), Zenodo, 16 Aug 2024
DOI: 10.5281/zenodo.13332803, https://zenodo.org/records/13332803
Dataset / Software: Dataset