Optimizing parallel multiplication operation for rectangular and transposed matrices | IEEE Conference Publication | IEEE Xplore