D-STACK: High Throughput DNN Inference by Effective Multiplexing and Spatio-Temporal Scheduling of GPUs | IEEE Journals & Magazine | IEEE Xplore