Performance on SIMD architectures of auto-tuned programs for matrix multiplication | IEEE Conference Publication | IEEE Xplore