Scalable Heterogeneous Scheduling Based Model Parallelism for Real-Time Inference of Large-Scale Deep Neural Networks | IEEE Journals & Magazine | IEEE Xplore