CUDA Grid-Level Task Progression Algorithms | IEEE Conference Publication | IEEE Xplore