Scheduled System Maintenance:
On Monday, April 27th, IEEE Xplore will undergo scheduled maintenance from 1:00 PM - 3:00 PM ET (17:00 - 19:00 UTC). No interruption in service is anticipated.
By Topic

A Dynamic Data Grid Replication Strategy to Minimize the Data Missed

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
3 Author(s)
Lei, M. ; Univ. of Alabama, Tuscaloosa ; Vrbsky, S.V. ; Xiaoyan Hong

The data availability in a data grid system is complicated by node failure, data catalog error and an unreliable network. To improve the job response time and data availability, data is typically replicated in large scale data-massive applications. However, the dynamic behavior of a Grid user makes it difficult to determine where and how to make data replications to meet the system availability goal. Some strategies for data replication have previously been proposed, but they assumed unlimited storage for replicas. In this paper, we present two new metrics to measure the system data availability. We then model the system availability problem assuming limited replica storage and transfer this to a classic optimal problem. We present four strategies for limited replica storage that maximize the data availability by minimizing the data missed rate (MinDmr), based on a file weight and prediction function. Our simulation on the OptorSim shows our MinDmr algorithm achieves better performance overall than others in term of data availability. Results indicate the performance of MinDmr is always better than others with varying prediction functions, job schedulers and file access patterns, as far as the data missing rate is concerned.

Published in:

Broadband Communications, Networks and Systems, 2006. BROADNETS 2006. 3rd International Conference on

Date of Conference:

1-5 Oct. 2006