HAND: A Hybrid Approach to Accelerate Non-contiguous Data Movement Using MPI Datatypes on GPU Clusters | IEEE Conference Publication | IEEE Xplore