MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning | IEEE Journals & Magazine | IEEE Xplore