By Topic

Improving Performance of the Irregular Data Intensive Application with Small Computation Workload for CMPs

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

7 Author(s)
Gu Zhimin ; Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China ; Fu Yinxia ; Zheng Ninghan ; Zhang Jianxun
more authors

The data needs of scientific or commercial applications from a diverse range of fields have been increasing exponentially over the recent years. Although the traditional systems work well for computation that requires limited data handling, the CMPs in cloud computing may below performance for the computation that requires large amounts of intensive data. Conventional helper thread techniques try to improve the high performance overheads, but they can not improve performance of the irregular data intensive applications with small computation workload. Our goal is to provide a novel solution to improve the application performance in data intensive computing environments. By introducing the prepuce look ahead Size K, the prepush block size P and the synchronization block size B three operations to helper thread, we expect to reduce the overheads introduced by the traditional helper thread and leave the computing resources to perform useful prefetch work. As a starting point, we design the KPB interleaved data prepush algorithm, and use Q6600 and IBM 5110 multi-core computers as our test platforms to study behaviors of the benchmarks fromSPEC2006 suite and Olden suite. We construct the helper threads of mcf from SPEC2006, mst and em3d from Olden by using our method, the average result of speedup is 1.23, 1.32and 1.09 separately on the Q6600 machine, and 1.28, 1.35 and1.23 separately on another machine. Compared with the AP and PV methods, our method is less negative impact than both AP and PV, our KPB-method is also better than AP and PV in the prefetching timeliness and control ability.

Published in:

2011 40th International Conference on Parallel Processing Workshops

Date of Conference:

13-16 Sept. 2011