As processor performance continues to improve, more emphasis must be placed on the performance of the memory system. In this paper, a detailed characterization of data cache behavior for individual load instructions is given. We show that by selectively applying cache line allocation according the characteristics of individual load instructions, overall performance can be improved for both the data cache and the memory system. This approach can improve some aspects of memory performance by as much as 60 percent on existing executables
Published in:
Microarchitecture, 1995., Proceedings of the 28th Annual International Symposium on
Date of Conference: 29 Nov-1 Dec 1995