Conferences >2016 49th Annual IEEE/ACM Int...

Continuous runahead: Transparent hardware acceleration for memory intensive workloads

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Runahead execution pre-executes the application's own code to generate new cache misses. This pre-execution results in prefetch requests that are overwhelmingly accurate ...Show More

Metadata

Abstract:

Runahead execution pre-executes the application's own code to generate new cache misses. This pre-execution results in prefetch requests that are overwhelmingly accurate (95% in a realistic system configuration for the memory intensive SPEC CPU2006 benchmarks), much more so than a global history buffer (GHB) or stream prefetcher (by 13%/19%). However, we also find that current runahead techniques are very limited in coverage: they prefetch only a small fraction (13%) of all runahead-reachable cache misses. This is because runahead intervals are short and limited by the duration of each full-window stall. In this work, we explore removing the constraints that lead to these short intervals. We dynamically filter the instruction stream to identify the chains of operations that cause the pipeline to stall. These operations are renamed to execute speculatively in a loop and are then migrated to a Continuous Runahead Engine (CRE), a shared multi-core accelerator located at the memory controller. The CRE runs ahead with the chain continuously, increasing prefetch coverage to 70% of runahead-reachable cache misses. The result is a 43.3% weighted speedup gain on a set of memory intensive quad-core workloads and a significant reduction in system energy consumption. This is a 21.9% performance gain over the Runahead Buffer, a state-of-the-art runahead proposal and a 13.2%/13.5% gain over GHB/stream prefetching. When the CRE is combined with GHB prefetching, we observe a 23.5% gain over a baseline with GHB prefetching alone.

Published in: 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Date of Conference: 15-19 October 2016

Date Added to IEEE Xplore: 15 December 2016

ISBN Information:

DOI: 10.1109/MICRO.2016.7783764

Conference Location: Taipei, Taiwan

Contents

References is not available for this document.

Continuous runahead: Transparent hardware acceleration for memory intensive workloads

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Continuous runahead: Transparent hardware acceleration for memory intensive workloads

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?