FastLanes: A Next-Gen File Format

Azim Afroozeh

Research output: PhD ThesisPhD-Thesis - Research and graduation internal

181 Downloads (Pure)

Abstract

This thesis presents the design and implementation of FastLanes, a next-generation file format for OLAP workloads on modern CPUs and GPUs. It redesigns lightweight encodings to be more data-parallel, fully exploiting SIMD and GPU parallelism, achieving decoding speeds of over 100 billion integers per second in scalar execution. It also introduces ALP (Adaptive Lossless Floating-Point Compression)—a novel, adaptive, and SIMD-friendly floating-point compressor that surpasses state-of-the-art methods such as ZSTD. GPU extensions of FastLanes adapt these techniques to thread-level parallelism, addressing shared-memory and warp-divergence bottlenecks to accelerate analytical workloads on GPU-based engines such as Crystal. The dissertation integrates these innovative encodings into a fully functional file format, also called FastLanes, which combines multiple lightweight codecs through Expression Encoding—a composable representation that merges encoding strategies (e.g., FOR, RLE, DICT) to achieve compression ratios comparable to heavyweight compressors while maintaining exceptional decompression speed. Furthermore, FastLanes is released as open-source software, ensuring reproducibility and enabling future research in hardware-optimized data storage and analytics.
Original languageEnglish
QualificationPhD
Awarding Institution
  • Vrije Universiteit Amsterdam
Supervisors/Advisors
  • Boncz, Peter, Supervisor
  • Mühleisen, H., Co-supervisor, -
Award date9 Jan 2026
DOIs
Publication statusPublished - 9 Jan 2026

Fingerprint

Dive into the research topics of 'FastLanes: A Next-Gen File Format'. Together they form a unique fingerprint.

Cite this