Performance upper bound analysis and optimization of SGEMM on Fermi and Kepler GPUs | IEEE Conference Publication | IEEE Xplore