By Topic

Parallelization Analysis on Clusters of Multicore Nodes Using Shared and Distributed Memory Parallel Computing Models

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Tinetti, F.G. ; III-LIDI, Univ. Nac. de La Plata, La Plata, Argentina ; Wolfmann, G.

This paper presents alternatives and performance results obtained by analyzing parallelization on a cluster of multicore nodes. The ultimate goal is to show if both shared and distributed memory parallel processing models need to be taken into account independently, or if one affects the other and both must be considered simultaneosly. The application used as a testbed is classical in the context of high performance computing: matrix multiplication. Results are shown in terms of the conditions under which performance is optimized and where to focus the parallelization efforts on clusters with nodes with multiple cores, based on experiments combining both kinds of parallel models. In any case, all processing units should be effectively used in order to optimize the performance of parallel applications.

Published in:

Computer Science and Information Engineering, 2009 WRI World Congress on  (Volume:2 )

Date of Conference:

March 31 2009-April 2 2009