Design and Implementation of Kernel-based MPI Reduction Operations for Intel GPU s | IEEE Conference Publication | IEEE Xplore