Learn-to-Scale: Parallelizing Deep Learning Inference on Chip Multiprocessor Architecture | IEEE Conference Publication | IEEE Xplore