Loading web-font TeX/Main/Regular
HV-SNSP: A Low-Overhead Data Recovery Method Based on Cross-Checking | IEEE Journals & Magazine | IEEE Xplore

HV-SNSP: A Low-Overhead Data Recovery Method Based on Cross-Checking


The recovery time of HV-SNSP is reduced in proportion compared with other methods when there are multiple nodes in each rack

Abstract:

The failure of a single unreliable commodity component is very common in large-scale distributed storage systems. In order to ensure the reliability of data in large-scal...Show More

Abstract:

The failure of a single unreliable commodity component is very common in large-scale distributed storage systems. In order to ensure the reliability of data in large-scale distributed storage systems, many studies have emerged one after another. Among them, Erasure Codes are widely used in actual storage systems, such as Hadoop Distributed File System (HDFS), which provides high fault-tolerance with lower storage overhead. However, usually the recovery of erasure-coded storage system when encountering node failure results in severe cross-node and cross-rack bandwidth consumption, which affects the efficiency of failure recovery and wastes additional resources. In this paper, we improve the erasure coding storage strategy in distributed storage systems, and propose a low-overhead data recovery method based on cross-checking, namely HV-SNSP. In HV-SNSP, horizontal and vertical cross parity checking is realized by adding RS parity inside the data node, that is, \text{H}^{\mathrm {RS(n, k)}} - \text{V}^{\mathrm {RS(n', \textrm {}k')}} storage architecture. Based on \text{H}^{\mathrm {RS(n, \textrm {}k)}} - \text{V}^{\mathrm {RS(n', k')}} , a low-cost supply node selection strategy, namely SNSP, is designed, and nodes with shorter network distance and lower load are selected to participate in recovery. This strategy can effectively reduce the amount of data transmission, shorten the recovery time, and improve the recovery efficiency. The experimental results show that compared with traditional RS, HV-SNSP can reduce the amount of cross-rack data transmission by 62.5% during data recovery, and can shorten the recovery time by up to 42.41%; Compared with D3, HV-SNSP can reduce the occupation of cross-rack bandwidth by 25% and shorten the recovery time by 36.58%.
The recovery time of HV-SNSP is reduced in proportion compared with other methods when there are multiple nodes in each rack
Published in: IEEE Access ( Volume: 11)
Page(s): 5737 - 5745
Date of Publication: 11 January 2023
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.