Accelerating Deep Learning Using Interconnect-Aware UCX Communication for MPI Collectives | IEEE Journals & Magazine | IEEE Xplore