Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion | IEEE Journals & Magazine | IEEE Xplore