Automatically extracting class diagrams from spreadsheets

Felienne Hermans*, Martin Pinzger, Arie Van Deursen

*Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review


The use of spreadsheets to capture information is widespread in industry. Spreadsheets can thus be a wealthy source of domain information. We propose to automatically extract this information and transform it into class diagrams. The resulting class diagram can be used by software engineers to understand, refine, or re-implement the spreadsheet's functionality. To enable the transformation into class diagrams we create a library of common spreadsheet usage patterns. These patterns are localized in the spreadsheet using a two- dimensional parsing algorithm. The resulting parse tree is transformed and enriched with information from the library. We evaluate our approach on the spreadsheets from the Euses Spreadsheet Corpus by comparing a subset of the generated class diagrams with reference class diagrams created manually.

Original languageEnglish
Title of host publicationECOOP 2010 - Object-Oriented Programming - 24th European Conference, Proceedings
Number of pages24
Publication statusPublished - 3 Aug 2010
Externally publishedYes
Event24th European Conference on Object-Oriented Programming, ECOOP 2010 - Maribor, Slovenia
Duration: 21 Jun 201025 Jun 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6183 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference24th European Conference on Object-Oriented Programming, ECOOP 2010


Dive into the research topics of 'Automatically extracting class diagrams from spreadsheets'. Together they form a unique fingerprint.

Cite this