Generic Matrix Multiplication for Multi-GPU Accelerated Distributed-Memory Platforms over PaRSEC | IEEE Conference Publication | IEEE Xplore