A performance study of big data workloads in cloud datacenters with network variability

Alexandru Uta, Harry Obaseki

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Public cloud computing platforms are a cost-effective solution for individuals and organizations to deploy various types of workloads, ranging from scientific applications, business-critical workloads, e-governance to big data applications. Co-locating all such different types of workloads in a single datacenter leads not only to performance degradation, but also to large degrees of performance variability, which is the result of virtualization, resource sharing and congestion. Many studies have already assessed and characterized the degree of resource variability in public clouds. However, we are missing a clear picture on how resource variability impacts big data workloads. In this work, we take a step towards characterizing the behavior of big data workloads under network bandwidth variability. Emulating real-world clouds’ bandwidth distribution, we characterize the performance achieved by running real-world big data applications. We find that most big data workloads are slowed down under network variability scenarios, even those that are not network-bound. Moreover, the maximum average slowdown for the cloud setup with highest variability is 1.48 for CPU-bound workloads, and 1.79 for network-bound workloads.

Original languageEnglish
Title of host publicationICPE 2018 - Companion of the 2018 ACM/SPEC International Conference on Performance Engineering
PublisherAssociation for Computing Machinery, Inc
Pages113-118
Number of pages6
Volume2018-January
ISBN (Electronic)9781450356299
ISBN (Print)9781450356299
DOIs
Publication statusPublished - 2 Apr 2018
Event9th ACM/SPEC International Conference on Performance Engineering, ICPE 2018 - Berlin, Germany
Duration: 9 Apr 201813 Apr 2018

Conference

Conference9th ACM/SPEC International Conference on Performance Engineering, ICPE 2018
Country/TerritoryGermany
CityBerlin
Period9/04/1813/04/18

Fingerprint

Dive into the research topics of 'A performance study of big data workloads in cloud datacenters with network variability'. Together they form a unique fingerprint.

Cite this