Optimizing Distributed ML Communication with Fused Computation-Collective Operations | IEEE Conference Publication | IEEE Xplore