Loading [MathJax]/extensions/MathMenu.js
Sparsity Aware of TF-IDF Matrix to Accelerate Oblivious Document Ranking and Retrieval | IEEE Conference Publication | IEEE Xplore

Sparsity Aware of TF-IDF Matrix to Accelerate Oblivious Document Ranking and Retrieval


Abstract:

Due to cloud security concerns, there is an increasing interest in information retrieval systems that can support private queries over public documents. It is desirable f...Show More

Abstract:

Due to cloud security concerns, there is an increasing interest in information retrieval systems that can support private queries over public documents. It is desirable for oblivious document ranking and retrieval in public cloud at lower cost and faster speed without revealing query-related information. Currently, the term frequency-inverse document frequency (TF-IDF) and private information retrieval (PIR) techniques are used to solve this problem, but the encryption operation time is over dominant. Motivated by the observation of the sparsity of the TF-IDF matrix, we propose an efficient approach for oblivious document ranking and retrieval, called E-Coeus. It takes advantage of the high sparsity of the TF-IDF matrix to rearrange the matrix. Our method accelerates the speed of PIR inadvertently retrieving documents and reduces the user retrieval delay time. In a stand-alone experiment for a TF-IDF matrix of 1.2M rows and 64K columns with the sparsity of 10%, E-Coeus improves the document ranking and retrieval performance by 23% over the state-of-the-art approach, Coeus. With cluster of 64 machines, E-Coeus improves the performance by 34% over Coeus when the TF-IDF matrix sparsity is 30%.
Date of Conference: 01-03 November 2023
Date Added to IEEE Xplore: 29 May 2024
ISBN Information:

ISSN Information:

Conference Location: Exeter, United Kingdom

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.