Comparative benchmarking: matrix multiplication on a multicore coprocessor and a GPU | IEEE Conference Publication | IEEE Xplore