Efficient and Privacy-Preserving Feature Selection Based on Multiparty Computation | IEEE Journals & Magazine | IEEE Xplore

Efficient and Privacy-Preserving Feature Selection Based on Multiparty Computation


Abstract:

Feature selection is a critical data preprocessing stage that has been proven beneficial in data mining and machine learning applications. As most current works focus on ...Show More

Abstract:

Feature selection is a critical data preprocessing stage that has been proven beneficial in data mining and machine learning applications. As most current works focus on privacy during the training and inference tasks in machine learning, implementing privacy preservation in preprocessing is a powerful complement. In this paper, we present an efficient and privacy-preserving feature selection protocol (EPFS) based on secure multiparty computation (MPC). We customize a novel method called approximate fixed-point representation to reduce the bitwidth of the sample distribution probability, thereby decreasing communication overhead. We optimize the comparison protocol by reducing the high-order bits of values according to the characteristics of the datasets and design the feature score calculation protocol together with several other MPC-based sub-protocols. We also construct an efficient feature selection workflow to obtain the reduced feature matrix, which avoids the numerous calls of secure comparison and equality test protocols in loops. Experiments on several real-world datasets show that the improved comparison protocol achieves a 29%-53% improvement in runtime and a 6%-32% reduction in communication compared to the general comparison protocol. The optimized feature selection workflow exhibits an upper performance bound, achieving a 38% improvement in runtime compared to prior work. Besides, we implement secure logistic regression training based on the selection features, where the accuracy has improved by an average of 8% compared to training on raw features.
Page(s): 3505 - 3518
Date of Publication: 05 March 2025

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.