Online Filter Weakening and Pruning for Efficient Convnets | IEEE Conference Publication | IEEE Xplore

Online Filter Weakening and Pruning for Efficient Convnets


Abstract:

Pruning is an effective method to address the limitation of deploying deep neural networks (DNNs) on embedded systems. Most existing methods prune weights on a given pre-...Show More

Abstract:

Pruning is an effective method to address the limitation of deploying deep neural networks (DNNs) on embedded systems. Most existing methods prune weights on a given pre-trained DNN followed by a costly fine-tuning process. In this paper, we propose a new and efficient pruning algorithm which can prune the structures of filters and filter shapes effectively. This is achieved by defining filter-wise and shape-wise scaling factors to indicate those to be weakened. Then train the network from scratch and multiply weights with corresponding scaling factors. With iterative update, the weights of the selected filters and shapes are gradually weakened to zero and then pruned with little loss to the model capability. We demonstrate the effectiveness of our approach on several CNN models and datasets. For VGG-16 on CIFAR10, we achieve more than 2 x FLOPs reduction and compression with higher accuracy. And for WRN-16-4 on CIFAR100, our method exhibits more than 2 × speedup and compression with less than 1% accuracy drop.
Date of Conference: 23-27 July 2018
Date Added to IEEE Xplore: 11 October 2018
ISBN Information:

ISSN Information:

Conference Location: San Diego, CA, USA

Contact IEEE to Subscribe

References

References is not available for this document.