Abstract
MapReduce ecosystems are (still) widely popular for big data processing in data centers. To address the diverse non-functional requirements arising from many and increasingly more sophisticated users, the community has developed many scheduling policies for MapReduce workloads. Although some individual policies can dynamically optimize for single and stable performance objectives, such as minimizing runtime or cost, or meeting deadlines for realtime-jobs, it seems unlikely that individual policies will remain competitive for increasingly more dynamic workloads and objectives. In contrast, in this work we investigate the ability to dynamically balance performance and cost of a portfolio scheduler for MapReduce workloads. To this end, we design and implement a portfolio scheduling technique, that is, a system capable of adapting to the current workload characteristics and target objectives by periodically evaluating its set of potential policies, and of switching to »the best» policy that targets the current system state. We implement and evaluate our system with real-world experiments on a workload containing a mixture of real-time and batch jobs, with the purpose of minimizing deadline violations, while keeping batch job slowdown in check. Our results show that POSUM is a promising alternative: it can out-perform the individual policies of its portfolio for the combined optimization goal, even without precise predictions.
Original language | English |
---|---|
Title of host publication | 2018 IEEE International Conference on Big Data (Big Data) 10-13 Dec. 2018 |
Subtitle of host publication | Proceedings |
Editors | Yang Song, Bing Liu, Kisung Lee, Naoki Abe, Calton Pu, Mu Qiao, Nesreen Ahmed, Donald Kossmann, Jeffrey Saltz, Jiliang Tang, Jingrui He, Huan Liu, Xiaohua Hu |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 351-357 |
Number of pages | 7 |
ISBN (Electronic) | 9781538650356 |
ISBN (Print) | 9781538650363 |
DOIs | |
Publication status | Published - 2019 |
Event | 2018 IEEE International Conference on Big Data, Big Data 2018 - Seattle, United States Duration: 10 Dec 2018 → 13 Dec 2018 |
Conference
Conference | 2018 IEEE International Conference on Big Data, Big Data 2018 |
---|---|
Country/Territory | United States |
City | Seattle |
Period | 10/12/18 → 13/12/18 |
Funding
Work supported by the projects Vidi MagnaData and COMMIT/.
Funders | Funder number |
---|---|
COMMIT |