Accelerating Allreduce With In-Network Reduction on Intel PIUMA | IEEE Journals & Magazine | IEEE Xplore