GRID: Gradient Routing With In-Network Aggregation for Distributed Training | IEEE Journals & Magazine | IEEE Xplore