Analyzing BigData with Hadoop cluster in HDInsight azure Cloud | IEEE Conference Publication | IEEE Xplore

Analyzing BigData with Hadoop cluster in HDInsight azure Cloud


Abstract:

Recently Cloud based Hadoop has gained a lot of interest that offer ready to use Hadoop cluster environment for processing of Big Data, eliminating the operational challe...Show More

Abstract:

Recently Cloud based Hadoop has gained a lot of interest that offer ready to use Hadoop cluster environment for processing of Big Data, eliminating the operational challenges of on-site hardware investment, IT support, and installing, configuring of Hadoop components such as HDFS and MapReduce. On demand Hadoop as a service helps the industries to focus on business growth and based on pay per use model for Big Data processing with auto-scaling of Hadoop cluster feature. In this paper implementation of various MapReduce jobs like Pi, TeraSort, WordCount has been done on cloud based Hadoop deployment by using Microsoft Azure cloud services. Performance of MapReduce jobs has been evaluated with respect to CPU execution time with varying size of Hadoop cluster. From the experimental result, it is found that CPU execution time to finish the jobs decrease as the number of Data Nodes in HDInsight cluster increases and indicates the good response time with increase in performance as well as more customer satisfaction.
Date of Conference: 17-20 December 2015
Date Added to IEEE Xplore: 31 March 2016
ISBN Information:
Electronic ISSN: 2325-9418
Conference Location: New Delhi, India

Contact IEEE to Subscribe

References

References is not available for this document.