Abstract
Data-driven interactive computation is widely used for business analytics, search-based decision-making, and log mining. These applications' short duration and bursty nature makes them a natural fit for serverless computing. Data processing serverless applications are composed of many small tasks. Application tasks that use remote storage encounter bottlenecks in the form of high latency, performance variability, and throttling. Caching has been used to mitigate this bottleneck for intermediate data. However, the use of caching for input data, albeit widely used in industry, has yet to be studied. We present the first performance study of scaling, a key feature of serverless computing, on serverless clusters with input data caches. We compare 8 task placement algorithms and quantify their impact on task slowdown and resource usage before and after scaling. We quantify the consequences of using work stealing. We quantify the performance impact of scaling in the buffer period immediately after scaling. We find up to a 420% increase in task slowdown after scaling without work stealing and a 22% slowdown with work stealing. We also find that cache misses after scaling can lead to an additional 21% resource usage.
Original language | English |
---|---|
Title of host publication | CF 2023 |
Subtitle of host publication | Proceedings of the 20th ACM International Conference on Computing Frontiers |
Publisher | Association for Computing Machinery, Inc |
Pages | 164-175 |
Number of pages | 12 |
ISBN (Electronic) | 9798400701405 |
DOIs | |
Publication status | Published - May 2023 |
Event | 20th ACM International Conference on Computing Frontiers, CF 2023 - Bologna, Italy Duration: 9 May 2023 → 11 May 2023 |
Conference
Conference | 20th ACM International Conference on Computing Frontiers, CF 2023 |
---|---|
Country/Territory | Italy |
City | Bologna |
Period | 9/05/23 → 11/05/23 |
Bibliographical note
Publisher Copyright:© 2023 Owner/Author.
Funding
Funders | Funder number |
---|---|
European Commission | 101093202 |
Keywords
- caching
- performance
- scheduling
- serverless