Language-agnostic integrated queries in a managed polyglot runtime

Filippo Schiavio, Daniele Bonetta, Walter Binder

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Language-integrated query (LINQ) frameworks offer a convenient programming abstraction for processing in-memory collections of data, allowing developers to concisely express declarative queries using general-purpose programming languages. Existing LINQ frameworks rely on the well-defined type system of statically-typed languages such as C♯ or Java to perform query compilation and execution. As a consequence of this design, they do not support dynamic languages such as Python, R, or JavaScript. Such languages are however very popular among data scientists, who would certainly benefit from LINQ frameworks in data analytics applications. In this work we bridge the gap between dynamic languages and LINQ frameworks. We introduce DynQ, a novel query engine designed for dynamic languages. DynQ is language-agnostic, since it is able to execute SQL queries in a polyglot language runtime. Moreover, DynQ can execute queries combining data from multiple sources, namely in-memory object collections as well as on-file data and external database systems. Our evaluation of DynQ shows performance comparable with equivalent hand-optimized code, and in line with common data-processing libraries and embedded databases, making DynQ an appealing query engine for standalone analytics applications and for data-intensive server-side workloads.
Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
PublisherVLDB Endowment
Pages1414
Volume14
DOIs
Publication statusPublished - 2021
Externally publishedYes
Event47th International Conference on Very Large Data Bases, VLDB 2021 - Virtual, Online
Duration: 16 Aug 202120 Aug 2021

Publication series

NameProceedings of the VLDB Endowment
ISSN (Electronic)2150-8097

Conference

Conference47th International Conference on Very Large Data Bases, VLDB 2021
CityVirtual, Online
Period16/08/2120/08/21

Funding

The work presented in this paper has been supported by Oracle (ERO project 1332). We thank the VLDB reviewers for their detailed feedback and the VM Research Group at Oracle Labs for their support. Oracle, Java, and HotSpot are trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.

FundersFunder number
VM Research Group at Oracle Labs

    Fingerprint

    Dive into the research topics of 'Language-agnostic integrated queries in a managed polyglot runtime'. Together they form a unique fingerprint.

    Cite this