Scalable and Fast Inference Serving via Hybrid Communication Scheduling on Heterogeneous Networks | IEEE Conference Publication | IEEE Xplore