I. Introduction
Nowadays, big data analytics has become the focus of research in the various fields of government, manufacturing, healthcare, media, and science because of the growing popularity of internet of things (IoT). In fact, all of the industries have to confront the issues of big data analytics. The properties of big data include variety, volume, velocity and value. The “4Vs” is widely applied to the definition of big data [1]. Due to these properties, big data is not easy to be analyzed using a single computer. Many software framework developments with high scalability and fault tolerance, for example, Hadoop and Spark, are provided to handle massive data.