Accelerating Persistent Hash Indexes via Reducing Negative Searches | IEEE Conference Publication | IEEE Xplore

Accelerating Persistent Hash Indexes via Reducing Negative Searches


Abstract:

Hashing is a widely used and efficient indexing mechanism for key-value storage. Persistent memory (PM) has attracted extensive attention in research due to its non-volat...Show More

Abstract:

Hashing is a widely used and efficient indexing mechanism for key-value storage. Persistent memory (PM) has attracted extensive attention in research due to its non-volatility and DRAM-like performance. Intel DCPMM, as a PM, can provide large capacity and low total cost of ownership, further promoting the research of PM-based hash index. However, based on real-world workloads, we found that negative searches of existing PM-based hash indexes significantly degrade system performance. A direct method to solve this problem is to use a PM-based Bloom filter to reduce negative searches, but at the cost of the decreased lifespan of PM due to extra PM writes. An alternative method is to use a DRAM-based Bloom filter, but it still faces increased multi-threaded insertion/deletion/positive-search scalability overhead as well as increased data consistency and recovery overhead.In this paper, we propose SmartHT, a small-size DRAM-based Bloom filter to accelerate hash table operations for PM while solving the aforementioned problems. SmartHT uses efficient merge write optimization with head insertion, lazy deletion, and shortened average chained length of head-bucket to provide high insertion/deletion/positive-search scalability, respectively. On the other hand, it utilizes a merged-flush mechanism based on an 8-byte failure-atomic write method to reduce flush instructions and extra PM writes to achieve low data consistency overhead. Experimental results on Intel Optane DCPMM show that, compared with the state-of-the-art persistent hash indexes, SmartHT improves multi-threaded negative queries under uniform and skewed distributions by 4.61x-13.86x and 2.76x-12.99x respectively, achieves high multi-threaded scalability and low data consistency overhead, at the modest cost of recovery time overhead.
Date of Conference: 06-08 November 2023
Date Added to IEEE Xplore: 22 December 2023
ISBN Information:

ISSN Information:

Conference Location: Washington, DC, USA

Funding Agency:


I. Introduction

Persistent memory (PM) provides a new and promising path of building large-scale and low-latency memory and storage systems. PM, such as phase-change memory [1] and Intel Optane DC Persistent Memory Module (DCPMM) [2], offers desirable properties such as large capacity, high performance, non-volatility, and byte addressability. PM is expected to mix with DRAM as a hybrid memory system, bridging the performance gap between HDD and DRAM and enriching the storage layer. The DCPMM released by Intel has a single-device capacity of up to 256GB, and the maximum single-machine capacity is 8TB (512GB/DIMM * 16DIMMs), making it capable of providing a real-time processing system with high throughput, low latency, and low total cost of ownership (TCO) for a wide range of applications.

Contact IEEE to Subscribe

References

References is not available for this document.