Abstract:
In today's world, there is huge importance for analyzing large data sets in a short span of time. Hadoop is one of such framework that is used to store and process huge u...Show MoreMetadata
Abstract:
In today's world, there is huge importance for analyzing large data sets in a short span of time. Hadoop is one of such framework that is used to store and process huge unstructured or semi structured data in a distributed manner. The main theme of this paper is to analyze clickstream data that has been gathered from online retail e-commerce website using Hadoop framework. In this process, we are going to use many tools like Pig, Hive, Sqoop which works based on map-reduce algorithm in order to process big data in efficient way. The Insight finding mechanism used to tell us day wise sales report, hourly sales report and top sold item reports based on the clickstream dataset. In the end, the output visualization plots will give the detailed insights based on the clickstream data that we have processed.
Published in: 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)
Date of Conference: 20-21 April 2018
Date Added to IEEE Xplore: 27 September 2018
ISBN Information: