Continuous skyline queries on multicore architectures

Tiziano De Matteis, Salvatore Di Girolamo, Gabriele Mencagli

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

The emergence of real-time decision-making applications in domains like high-frequency trading, emergency management, and service level analysis in communication networks has led to the definition of new classes of queries. Skyline queries are a notable example. Their results consist of all the tuples whose attribute vector is not dominated (in the Pareto sense) by one of any other tuple. Because of their popularity, skyline queries have been studied in terms of both sequential algorithms and parallel implementations for multiprocessors and clusters. Within the Data Stream Processing paradigm, traditional database queries on static relations have been revised in order to operate on continuous data streams. Most of the past papers propose sequential algorithms for continuous skyline queries, whereas there exist very few works targeting implementations on parallel machines. This paper contributes to fill this gap by proposing a parallel implementation for multicore architectures. We propose (i) a parallelization of the eager algorithm based on the notion of Skyline Influence Time, (ii) optimizations of the reduce phase and load-balancing strategies to achieve near-optimal speedup, and (iii) a set of experiments with both synthetic benchmarks and a real dataset in order to show our implementation effectiveness. Copyright © 2016 John Wiley & Sons, Ltd.
Original languageEnglish
Pages (from-to)3503-3522
JournalConcurrency and Computation: Practice and Experience
Volume28
Issue number12
DOIs
Publication statusPublished - 25 Aug 2016
Externally publishedYes

Fingerprint

Dive into the research topics of 'Continuous skyline queries on multicore architectures'. Together they form a unique fingerprint.

Cite this