CAP: Communication-Aware Automated Parallelization for Deep Learning Inference on CMP Architectures | IEEE Journals & Magazine | IEEE Xplore