Achieving maximum performance for matrix multiplication using set associative cache | IEEE Conference Publication | IEEE Xplore