cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUs | IEEE Journals & Magazine | IEEE Xplore