Skip to Main Content
Power-saving storage in data centers is now gaining much interest because of the increase in power requirements. In particular, with the expansion of services requiring distributed-processing frameworks, power proportionality in power-aware file systems is attracting great attention from academia and industry. The concept of power proportionality is that a system should perform work in proportion to the energy it consumes. To provide this important characteristic, data placement methods that enable a system to operate in multiple gears, each containing a different number of active nodes, have been proposed. Of these, RABBIT, with its power proportional data placement, is a novel method that is implemented over a Hadoop Distributed File System (HDFS) and has been shown to be successful in guaranteeing power proportionality for read only tasks. However, the data layout used in RABBIT does not consider write access occurring when the system operates in low gear and there are inactive nodes. In this paper, to discover an appropriate approach for power-efficient distributed file systems, we evaluate the performance of the RABBIT and PARAID methods on read-only tasks and the cost to system performance of write access in low gear. The skewed data placement in PARAID, which was one of the first methods to suggest power proportionality in disk-based storage systems, is supposed to offer good performance when dealing with a frequently updated dataset.