Enabling Efficient Fast Convolution Algorithms on GPUs via MegaKernels | IEEE Journals & Magazine | IEEE Xplore