Serving DNN Inference With Fine-Grained Spatio-Temporal Sharing of GPU Servers | IEEE Journals & Magazine | IEEE Xplore