Skip to Main Content
With the increasing popularity of cloud computing, Hadoop has become a widely used open source cloud computing framework for large scale data processing. However, few studies have been done to enhance data confidentiality of Hadoop against storage servers. In this paper, we address the data confidentiality issue by integrating hybrid encryption schemes and the Hadoop distributed file system (HDFS). We propose and implement two integrations, HDFS-RSA and HDFS-Pairing, as extensions of HDFS. Experiments are conducted to demonstrate the performance overhead of HDFS-RSA and HDFS-Pairing. Our integrations provide alternatives toward achieving data confidentiality for Hadoop.