Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks | IEEE Conference Publication | IEEE Xplore