Abstract:
Efficiently using the hardware capabilities of the Cell processor, a heterogeneous chip multiprocessor that uses several levels of parallelism to deliver high performance...Show MoreMetadata
Abstract:
Efficiently using the hardware capabilities of the Cell processor, a heterogeneous chip multiprocessor that uses several levels of parallelism to deliver high performance, and being able to reuse legacy code are real challenges for application developers. We propose to use Generative Programming and more precisely template meta-programming to design an domain specific embedded language using algorithmic skeletons to generate applications based on a high-level mapping description. The method is easy to use by developers and delivers performance close to the performance of optimized hand-written code, as shown on various benchmarks ranging from simple BLAS kernels to image processing applications.
Published in: 2009 18th International Conference on Parallel Architectures and Compilation Techniques
Date of Conference: 12-16 September 2009
Date Added to IEEE Xplore: 25 September 2009
Print ISBN:978-0-7695-3771-9
Print ISSN: 1089-795X