Performance improvement of CUDA applications by reducing CPU-GPU data transfer overhead | IEEE Conference Publication | IEEE Xplore