Skip to main navigation Skip to search Skip to main content

Columbo: A Reasoning Framework for Kubernetes' Configuration Space

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Resource managers such as Kubernetes are rapidly evolving to support low-latency and scalable computing paradigms such as serverless and granular computing. As a result, Kubernetes supports dozens of workload deployment models and exposes roughly 1,600 configuration parameters. Previous work has shown that parameter tuning can significantly improve Kubernetes' performance, but identifying which parameters impact performance and should be tuned remains challenging. To help users optimize their Kubernetes deployments, we present Columbo, an offline reasoning framework to detect and resolve performance bottlenecks using configuration parameters. We study Kubernetes and define its workload deployment pipeline of 6 stages and 26 steps. To detect bottlenecks, Columbo uses an analytical model to predict the best-case deployment time of a workload per pipeline stage and compares it to empirical data from a novel benchmark suite. Columbo then uses a rule-based methodology to recommend parameter updates based on the detected bottleneck, deployed workload, and mapping of configurations to pipeline stages. We demonstrate that Columbo reduces workload deployment time across its benchmark suite by 28% on average and 79% at most. We report a total execution time decrease of 17% for data processing with Spark and up to 20% for serverless workflows with OpenWhisk. Columbo is open-source and available at https://github.com/atlarge-research/continuum/tree/columbo.

Original languageEnglish
Title of host publicationICPE '25
Subtitle of host publicationProceedings of the 16th ACM/SPEC International Conference on Performance Engineering
PublisherAssociation for Computing Machinery, Inc
Pages45-57
Number of pages13
ISBN (Electronic)9798400710735
DOIs
Publication statusPublished - 2025
Event16th ACM/SPEC International Conference on Performance, ICPE 2025 - Toronto, Canada
Duration: 5 May 20259 May 2025

Conference

Conference16th ACM/SPEC International Conference on Performance, ICPE 2025
Country/TerritoryCanada
CityToronto
Period5/05/259/05/25

Bibliographical note

Publisher Copyright:
© 2025 Copyright held by the owner/author(s).

Keywords

  • configuration tuning
  • kubernetes
  • resource management

Fingerprint

Dive into the research topics of 'Columbo: A Reasoning Framework for Kubernetes' Configuration Space'. Together they form a unique fingerprint.

Cite this