Dynamic Load Balancing and Job Replication in a Global-Scale Grid Environment: A Comparison

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Global-scale grids provide a massive source of processing power, providing the means to support processor intensive parallel applications. The strong burstiness and unpredictability of the available processing and network resources raise the strong need to make applications robust against the dynamics of grid environments. The two main techniques that are most suitable to cope with the dynamic nature of the grid are Dynamic Load Balancing (DLB) and job replication (JR). In this paper, we analyze and compare the effectiveness of these two approaches by means of trace-driven simulations. We observe that there exists an easy-to-measure statistic Y and a corresponding threshold value Y*, such that DLB consistently outperforms JR when Y > Y*, whereas the reverse is true for Y < Y*. Based on this observation, we propose a simple and easy-to-implement approach, throughout referred to as the DLB/JR method, that can make dynamic decisions about whether to use DLB or JR. Extensive simulations based on a large set of real data monitored in a global-scale grid show that our DLB/JR method consistently performs at least as good as both DLB and JR in all circumstances, which makes our DLB/JR method highly robust against the unpredictable nature of global-scale grids. © 2009, IEEE. All rights reserved.
Original languageEnglish
Pages (from-to)207-218
JournalIEEE Transactions on Parallel and Distributed Systems
Volume20
DOIs
Publication statusPublished - 2009

Fingerprint

Dive into the research topics of 'Dynamic Load Balancing and Job Replication in a Global-Scale Grid Environment: A Comparison'. Together they form a unique fingerprint.

Cite this