By Topic

Approaches to balancing data load of shared-nothing clusters and their performance comparison

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
7 Author(s)
Jiahong Wang ; Fac. of Software & Inf. Sci., Iwate Prefectural Univ., Japan ; Y. Tsutaya ; N. Segawa ; S. Yamane
more authors

Using a cluster of PCs or workstations or the like (called nodes) to implement the database server can bring us two great benefits: high scalability and parallel processing capability. Before such a database server can be put into actual use, however two problems have to be solved. The one is how we cope with the data-skew since it can degrade the system performance significantly. The other is how a node is connected to or disconnected from a database server without affecting the users. One general solution to both problems is to redistribute the data. Unfortunately, this would take the data offline for a long time. In fact, numerous applications such as that for reservations, finance, process control, hospitals, police, and armed forces cannot afford the offline data for any significant amount of time. We address the subject of balancing data load online, i.e., balancing data load concurrently with users' reading and writing of the database. The main contributions are an effective approach for this purpose and a comprehensive performance study of the possible alternatives.

Published in:

Parallel and Distributed Systems, 2002. Proceedings. Ninth International Conference on

Date of Conference:

17-20 Dec. 2002