Abstract
Dynamic resource provisioning aims at maintaining the end-to-end response time of a web application within a pre-defined SLA. Although the topic has been well studied for monolithic applications, provisioning resources for applications composed of multiple services remains a challenge. When the SLA is violated, one must decide which service(s) should be reprovisioned for optimal effect. We propose to assign an SLA only to the front-end service. Other services are not given any particular response time objectives. Services are autonomously responsible for their own provisioning operations and collaboratively negotiate performance objectives with each other to decide the provisioning service(s). We demonstrate through extensive experiments that our system can add/remove/shift both servers and caches within an entire multi-service application under varying workloads to meet the SLA target and improve resource utilization.
Original language | English |
---|---|
Title of host publication | Proceedings of the 19th International World-Wide Web conference |
Place of Publication | New York, NY, USA |
Publisher | ACM |
Pages | 471-480 |
ISBN (Print) | 9781605587998 |
Publication status | Published - 2010 |
Event | 19th International World-Wide Web conference (WWW) - New York, NY, USA Duration: 1 Jan 2010 → 1 Jan 2010 |
Conference
Conference | 19th International World-Wide Web conference (WWW) |
---|---|
Period | 1/01/10 → 1/01/10 |