Interference-Aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach | IEEE Conference Publication | IEEE Xplore