Ristretto: A Framework for Empirical Study of Resource-Efficient Inference in Convolutional Neural Networks | IEEE Journals & Magazine | IEEE Xplore

Ristretto: A Framework for Empirical Study of Resource-Efficient Inference in Convolutional Neural Networks


Abstract:

Convolutional neural networks (CNNs) have led to remarkable progress in a number of key pattern recognition tasks, such as visual scene understanding and speech recogniti...Show More

Abstract:

Convolutional neural networks (CNNs) have led to remarkable progress in a number of key pattern recognition tasks, such as visual scene understanding and speech recognition, that potentially enable numerous applications. Consequently, there is a significant need to deploy trained CNNs to resource-constrained embedded systems. Inference using pretrained modern deep CNNs, however, requires significant system resources, including computation, energy, and memory space. To enable efficient implementation of trained CNNs, a viable approach is to approximate the network with an implementation-friendly model with only negligible degradation in classification accuracy. We present Ristretto, a CNN approximation framework that enables empirical investigation of the tradeoff between various number representation and word width choices and the classification accuracy of the model. Specifically, Ristretto analyzes a given CNN with respect to numerical range required to represent weights, activations, and intermediate results of convolutional and fully connected layers, and subsequently, it simulates the impact of reduced word width or lower precision arithmetic operators on the model accuracy. Moreover, Ristretto can fine-tune a quantized network to further improve its classification accuracy under a given number representation and word width configuration. Given a maximum classification accuracy degradation tolerance of 1%, we use Ristretto to demonstrate that three ImageNet networks can be condensed to use 8-bit dynamic fixed point for network weights and activations. Ristretto is available as a popular open-source software project1 and has already been viewed over 1000 times on Github as of the submission of this brief.
Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 29, Issue: 11, November 2018)
Page(s): 5784 - 5789
Date of Publication: 16 March 2018

ISSN Information:

PubMed ID: 29993820

Funding Agency:

Laboratory for Embedded and Programmable Systems, University of California at Davis, Davis, CA, USA
VLSI Computation Laboratory, University of California at Davis, Davis, CA, USA
Laboratory for Embedded and Programmable Systems, University of California at Davis, Davis, CA, USA
Laboratory for Embedded and Programmable Systems, University of California at Davis, Davis, CA, USA

Laboratory for Embedded and Programmable Systems, University of California at Davis, Davis, CA, USA
VLSI Computation Laboratory, University of California at Davis, Davis, CA, USA
Laboratory for Embedded and Programmable Systems, University of California at Davis, Davis, CA, USA
Laboratory for Embedded and Programmable Systems, University of California at Davis, Davis, CA, USA
Contact IEEE to Subscribe

References

References is not available for this document.