The memory subsystem in modem embedded programmable architectures executing multimedia applications consumes a significant amount of energy. The designer has to take this fact into consideration, together with the system performance, in order to design devices portable or otherwise. An exploration approach for optimising the power and performance of the data-memory hierarchy as well as the instruction memory in the early system-design phase, is introduced. A power- and performance-efficient data-memory hierarchy is obtained by applying data-reuse transformations in a high-level description of the application, whereas the instruction-memory power optimisation, of the selected optimal data hierarchies of the previous step, is achieved by using a suitably selected cache memory. Furthermore, two cache energy models, namely the high-level power model and the architecture-dependent power model, are introduced. The experimental results, obtained with four well known motion-estimation kernels, provide an insight on the trade-offs among algorithm performance and energy consumption, comparing memory hierarchies with and without an instruction cache for the ARM programmable core. Comparisons results are also provided for choosing an optimal cache memory size
Published in:
Computers and Digital Techniques, IEE Proceedings -
(Volume:149
,
Issue:
4
)
Date of Publication: Jul 2002