Loading [a11y]/accessibility-menu.js
Architecturally Compressed CNN: An Embedded Realtime Classifier (NXP Bluebox2.0 with RTMaps) | IEEE Conference Publication | IEEE Xplore

Architecturally Compressed CNN: An Embedded Realtime Classifier (NXP Bluebox2.0 with RTMaps)


Abstract:

The convolution neural networks have revolutionized the computer vision domain. It has proven to be a dominant technology to carry out tasks such as image classification,...Show More

Abstract:

The convolution neural networks have revolutionized the computer vision domain. It has proven to be a dominant technology to carry out tasks such as image classification, semantic segmentation, and object detection. The convolution neural networks surpass the performance of the existing algorithms such as SIFT, HOG, etcetera. Where, instead of manually engineering the features, supervised learning help to learn the essential low-level and high-level features necessary for classifications. The convolution neural networks have become a popular tool to counter computer vision problems. However, it is computationally, and memory intensive to train and deploy the network because of the model size of a deep convolution neural networks. However, the research in the field of design space exploration (DSE) of neural networks and compression techniques to develop compact architectures, have made convolution neural networks memory and computationally efficient. These techniques have also improved the feasibility of convolution neural network for deployment on embedded targets. The paper explores the concept of compact convolution filters to reduce the number of parameters in a convolution neural network. The intuition behind the approach is that replacing convolution filters with a stack of compact convolution filters helps in developing a compact architecture with competitive accuracy. This paper explores the fire module a compact convolution filter and proposes a method of recreating a state-of-the-art architecture VGG-16 using the fire modules to develop a compact architecture, which is further trained on the CIFAR-10 dataset and deployed on a real-time embedded platform known as Bluebox 2.0 by NXP using RTMaps software framework.
Date of Conference: 07-09 January 2019
Date Added to IEEE Xplore: 14 March 2019
ISBN Information:
Conference Location: Las Vegas, NV, USA
References is not available for this document.

I. Introduction

Since the proposition of the convolution neural networks, the field of deep learning has experienced an exponential growth, focussing on various aspects of deep learning such as defining new architectures like RESNET [5], Squeezenet [1], VGG16, VGG19 [2], developing new optimization techniques, various training methodology and implementation of non-linearity like ReLU, ELU, to combat optimization problems. These developments have substantially aided in training a deeper neural network for image recognition or object detection challenge. The increase in the depth of the neural networks has also led to accelerated development of various hardware architectures such as graphics processing unit, tensor processing unit and large-scale distributed deep networks [3], which uses parallel architectures and multiple computation units. The boards such as S32V234, Bluebox 2.0 (embedded systems) by NXP NVIDIAs TITAN, TESLA, GTX 1080(GPUs) and Jetson TK1(embedded systems) are widely being used for deploying and accelerating training process of various deep convolution neural networks.

Select All
1.
Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J. and Keutzer, K., 2016. Squeezenet: Alexnet-level accuracy with 50x fewer parameters and¡ 0.5 mb model size. arXiv preprint arXiv:1602.07360.[ https://arxiv.org/pdf/1602.07360.pdf ]
2.
Simonyan, K. and Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. [ https://arxiv.org/pdf/1409.1556.pdf ]
3.
Dean, J., Corrado, G., Monga, R., Chen, K., Devin, M., Mao, M., Senior, A., Tucker, P., Yang, K., Le, Q.V. and Ng, A.Y., 2012. Large scale distributed deep networks. In Advances in neural information processing systems (pp. 1223–1231 ).
4.
Krizhevsky, A., Sutskever, I. and Hinton, G.E., 2012. Imagenet classifi-cation with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097–1105 ).
5.
Szegedy, C., Ioffe, S., Vanhoucke, V. and Alemi, A.A., 2017, February. Inception-v4, inception-resnet and the impact of residual connections on learning. In AAAI (Vol. 4, p. 12 ).
6.
Lin, M., Chen, Q. and Yan, S., 2013. Network in network. arXiv preprint arXiv:1312.4400.[ https://arxiv.org/pdf/1312.4400.pdf ]
7.
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. and Rabinovich, A., 2015. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1–9 ).
8.
Szegedy, C., Ioffe, S., Vanhoucke, V. and Alemi, A.A., 2017, February. Inception-v4, inception-resnet and the impact of residual connections on learning. In AAAI (Vol. 4, p. 12 ).
9.
Realtime Applications with RTMaps and Bluebox 2.0, Sreeram Venki-tachalam, Surya Kollazhi Manghat, Akash Sunil Gaikwad, Niranjan Ravi, Sree Bala Shruthi Bhamidi and Mohamed El-Sharkawy Electrical and Computer Engineering Purdue School of Engineering and Technology, IUPUI, Int’l Conf. Artificial Intelligence — ICAI’18 —
10.
Nxp.com. ( 2018 ). NXP BlueBox Autonomous Driving—NXP. [online] Available at: https://www.nxp.com/products/processors-andmicrocontrollers/arm-based-processors-and-mcus/qoriq-layerscape-armprocessors/nxp-bluebox-autonomous-driving-developmentplatform:BLBX [ Accessed 6 Jun. 2018 ].
11.
Intempora.com. ( 2018 ). Intempora - RTMaps - A component- based framework for rapid development of multi-modal applications. [online] Available at: https://intempora.com/ [ Accessed 6 Jun. 2018 ].
12.
Intempora.com. ( 2018 ). Intempora - RTMaps. [online] Available at: https://intempora.com/products/rtmaps [ Accessed 6 Jun. 2018 ].
13.
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K. and Fei-Fei, L., 2009, June. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on (pp. 248–255 ).

Contact IEEE to Subscribe

References

References is not available for this document.