Bandwidth intensive 3-D FFT kernel for GPUs using CUDA | IEEE Conference Publication | IEEE Xplore