Compile-time partitioning of iterative parallel loops to reduce cache coherency traffic | IEEE Journals & Magazine | IEEE Xplore