Efficient Deep Learning Inference Based on Model Compression | IEEE Conference Publication | IEEE Xplore