Abstract
The Semantic Web community has produced a large body of literature that is becoming increasingly difficult to manage, browse, and use. Recent work on attention-based, sequence-to-sequence Transformer neural architecture has produced language models that generate surprisingly convincing synthetic conditional text samples. In this demonstration, we re-train the GPT-2 architecture using the complete corpus of proceedings of the International Semantic Web Conference since 2002 until 2019. We use user-provided sentences to conditionally sample paper snippets, therefore illustrating cases where this model can help at addressing challenges in scientific paper writing, such as navigating extensive literature, explaining the Semantic Web core concepts, providing definitions, and even inspiring new research ideas.
| Original language | English |
|---|---|
| Title of host publication | The Semantic Web |
| Subtitle of host publication | ESWC 2020 Satellite Events, Heraklion, Crete, Greece, May 31 – June 4, 2020, Revised Selected Papers |
| Editors | Andreas Harth, Valentina Presutti, Raphaël Troncy, Maribel Acosta, Axel Polleres, Javier D. Fernández, Josiane Xavier Parreira, Olaf Hartig, Katja Hose, Michael Cochez |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 158-163 |
| Number of pages | 6 |
| ISBN (Electronic) | 9783030623272 |
| ISBN (Print) | 9783030623265 |
| DOIs | |
| Publication status | Published - 2020 |
| Event | 17th Extended Semantic Web Conference, ESWC 2020 - Heraklion, Greece Duration: 31 May 2020 → 4 Jun 2020 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 12124 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 17th Extended Semantic Web Conference, ESWC 2020 |
|---|---|
| Country/Territory | Greece |
| City | Heraklion |
| Period | 31/05/20 → 4/06/20 |
Funding
Acknowledgements. “This paper would not have been possible without the support of several persons and institutions. GPT-2 would like to thank” all of the members of the GPT-1 technical committee. The authors want to thank Frank van Harmelen, Paul Groth, and the anonymous reviewers for their valuable comments. This work is partly supported by the CLARIAH project funded by NWO.
Keywords
- Natural language generation
- Scholarly communication
- Semantic Web papers