Empirical study on the usage of graph query languages in open source Java projects

Philipp Seifer, Johannes Härtel, Martin Leinberger, Ralf Lämmel, Steffen Staab

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Graph data models are interesting in various domains, in part because of the intuitiveness and flexibility they offer compared to relational models. Specialized query languages, such as Cypher for property graphs or SPARQL for RDF, facilitate their use. In this paper, we present an empirical study on the usage of graph-based query languages in open-source Java projects on GitHub. We investigate the usage of SPARQL, Cypher, Gremlin and GraphQL in terms of popularity and their development over time. We select repositories based on dependencies related to these technologies and employ various popularity and source-code based filters and ranking features for a targeted selection of projects. For the concrete languages SPARQL and Cypher, we analyze the activity of repositories over time. For SPARQL, we investigate common application domains, query use and existence of ontological data modeling in applications that query for concrete instance data. Our results show, that the usage of graph query languages in open-source projects increased over the last years, with SPARQL and Cypher being by far the most popular. SPARQL projects are more active in terms of query related artifact changes and unique developers involved, but Cypher is catching up. Relatively few applications use SPARQL to query for concrete instance data: A majority of those applications employ multiple different ontologies, including project and domain specific ones. Common application domains are management systems and data visualization tools.
Original languageEnglish
Title of host publicationSLE 2019 - Proceedings of the 12th ACM SIGPLAN International Conference on Software Language Engineering, co-located with SPLASH 2019
EditorsO. Nierstrasz, J. Gray, B.C.d.S. Oliveira
PublisherAssociation for Computing Machinery, Inc
Pages152-166
ISBN (Electronic)9781450369817
DOIs
Publication statusPublished - 20 Oct 2019
Externally publishedYes
Event12th ACM SIGPLAN International Conference on Software Language Engineering, SLE 2019, as part of SPLASH 2019 - Athens, Greece
Duration: 20 Oct 201922 Oct 2019

Conference

Conference12th ACM SIGPLAN International Conference on Software Language Engineering, SLE 2019, as part of SPLASH 2019
Country/TerritoryGreece
CityAthens
Period20/10/1922/10/19

Funding

The authors gratefully acknowledge the financial support of project LISeQ (LA 2672/1-1) by the German Research Foundation (DFG).

FundersFunder number
Deutsche Forschungsgemeinschaft
German-Israeli Foundation for Scientific Research and Development

    Fingerprint

    Dive into the research topics of 'Empirical study on the usage of graph query languages in open source Java projects'. Together they form a unique fingerprint.

    Cite this