By Topic

Analysis of resource usage profile for MapReduce applications using Hadoop on cloud

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Zheyuan Liu ; Control & Network Inst., Northwestern Polytech. Univ., Xi''an, China ; Dejun Mu

In this paper we present a study of resource consumption profiles for MapReduce applications using Hadoop on Amazon EC2. We selected three applications and measured their resource usage in terms of CPU and memory footprint. Specifically, we study Grep, Word Count and Sort applications while altering Hadoop's configuration parameters corresponding to I/O buffer. Our study brings up 3 key points. Firstly, effect of I/O parameters on total running time of the application; secondly, invalid assumptions of Hadoop scheduler that three phases (copy, sort and reduce) of a Reduce task are equal; finally, an insight supported by the results from the experiments on ways to improve the Hadoop scheduler for running multiple jobs by capturing the resource consumption information of different applications. To the best of our knowledge this is the first work that presents resource usage study.

Published in:

Quality, Reliability, Risk, Maintenance, and Safety Engineering (ICQR2MSE), 2012 International Conference on

Date of Conference:

15-18 June 2012