Effective multi-GPU communication using multiple CUDA streams and threads | IEEE Conference Publication | IEEE Xplore