AutoInfer: Self-Driving Management for Resource-Efficient, SLO-Aware Machine=Learning Inference in GPU Clusters | IEEE Journals & Magazine | IEEE Xplore