Loading [MathJax]/extensions/MathZoom.js
ALigN: A Highly Accurate Adaptive Layerwise Log_2_Lead Quantization of Pre-Trained Neural Networks | IEEE Journals & Magazine | IEEE Xplore

ALigN: A Highly Accurate Adaptive Layerwise Log_2_Lead Quantization of Pre-Trained Neural Networks


Average decimal accuracy of the weights of Conv1_2 layer of VGG-16 network using different quantization schemes. ALigN-x_y shows x bits reserved for storing the leading 1...

Abstract:

Deep Neural Networks are one of the machine learning techniques which are increasingly used in a variety of applications. However, the significantly high memory and compu...Show More

Abstract:

Deep Neural Networks are one of the machine learning techniques which are increasingly used in a variety of applications. However, the significantly high memory and computation demands of deep neural networks often limit their deployment on embedded systems. Many recent works have considered this problem by proposing different types of data quantization schemes. However, most of these techniques either require post-quantization retraining of deep neural networks or bear a significant loss in output accuracy. In this paper, we propose a novel and scalable technique with two different modes for the quantization of the parameters of pre-trained neural networks. In the first mode, referred to as log_2_lead, we use a single template for the quantization of all parameters. In the second mode, denoted as ALigN, we analyze the trained parameters of each layer and adaptively adjust the quantization template to achieve even higher accuracy. Our technique significantly maintains the accuracy of the parameters and does not require retraining of the networks. Moreover, it supports quantization to an arbitrary bit-size. For example, compared to the single-precision floating-point numbers-based implementation, our proposed 8-bit quantization technique generates only ~ 0.2% and ~ 0.1%, loss in the Top-1 and Top-5 accuracies respectively for VGG-16 network using ImageNet dataset. We have observed similar minimal losses in the Top-1 and Top-5 accuracies for AlexNet and Resnet-18 using the proposed quantization scheme for the 8-bit range. Our proposed quantization technique also provides a higher mean intersection over union for semantic segmentation when compared with state-of-the-art quantization techniques. The proposed technique represents parameters in powers of 2, thereby eliminating the need for resource-computationally intensive multiplier units for the hardware accelerators of the neural networks. We also present a design for implementing the multiplication operation using bi...
Average decimal accuracy of the weights of Conv1_2 layer of VGG-16 network using different quantization schemes. ALigN-x_y shows x bits reserved for storing the leading 1...
Published in: IEEE Access ( Volume: 8)
Page(s): 118899 - 118911
Date of Publication: 26 June 2020
Electronic ISSN: 2169-3536

Funding Agency:

Author image of Siddharth Gupta
Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, India
Siddharth Gupta received the B.Tech. degree in computer science and engineering from G. L. B. I. T. M., Greater Noida, India, in 2015. He is currently pursuing the M.S. degree in computer science and engineering from IIT Indore, Indore, India. His current research interests include deep learning, approximation techniques in machine learning, and hardware realization of DNNs.
Siddharth Gupta received the B.Tech. degree in computer science and engineering from G. L. B. I. T. M., Greater Noida, India, in 2015. He is currently pursuing the M.S. degree in computer science and engineering from IIT Indore, Indore, India. His current research interests include deep learning, approximation techniques in machine learning, and hardware realization of DNNs.View more
Author image of Salim Ullah
Department of Computer Science, Technische Universität Dresden, Dresden, Germany
Salim Ullah received the B.Sc. and M.Sc. degrees in computer systems engineering from the University of Engineering and Technology at Peshawar, Pakistan. He is currently pursuing the Ph.D. degree with the Chair of Processor Design, Technische Universität Dresden. His current research interests include the design of approximate arithmetic units, approximate caches, and hardware accelerators for deep neural networks.
Salim Ullah received the B.Sc. and M.Sc. degrees in computer systems engineering from the University of Engineering and Technology at Peshawar, Pakistan. He is currently pursuing the Ph.D. degree with the Chair of Processor Design, Technische Universität Dresden. His current research interests include the design of approximate arithmetic units, approximate caches, and hardware accelerators for deep neural networks.View more
Author image of Kapil Ahuja
Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, India
Kapil Ahuja received the B.Tech. degree from IIT (BHU), India, and the M.S. and Ph.D. degrees from Virginia Tech, USA. He was a Postdoctoral Research Fellow with the Max Planck Institute, Germany. He is currently an Associate Professor in Computer Science and Engineering at IIT Indore, India. In the past, he has been a Visiting Professor with TU Braunschweig, Germany, TU Dresden, Germany, and Sandia National Labs, USA. He...Show More
Kapil Ahuja received the B.Tech. degree from IIT (BHU), India, and the M.S. and Ph.D. degrees from Virginia Tech, USA. He was a Postdoctoral Research Fellow with the Max Planck Institute, Germany. He is currently an Associate Professor in Computer Science and Engineering at IIT Indore, India. In the past, he has been a Visiting Professor with TU Braunschweig, Germany, TU Dresden, Germany, and Sandia National Labs, USA. He...View more
Author image of Aruna Tiwari
Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, India
Aruna Tiwari (Member IEEE) received the B.E./M.E. degrees in computer science and engineering from SGSITS Indore, India, and the Ph.D. degree in computer science and engineering from RGPV Bhopal, India. She has a background in computer science and engineering. She has more than 20 years of teaching and research experience. She is with IIT Indore, India, since 2012. She is currently an Associate Professor of computer scien...Show More
Aruna Tiwari (Member IEEE) received the B.E./M.E. degrees in computer science and engineering from SGSITS Indore, India, and the Ph.D. degree in computer science and engineering from RGPV Bhopal, India. She has a background in computer science and engineering. She has more than 20 years of teaching and research experience. She is with IIT Indore, India, since 2012. She is currently an Associate Professor of computer scien...View more
Author image of Akash Kumar
Department of Computer Science, Technische Universität Dresden, Dresden, Germany
Akash Kumar (Senior Member, IEEE) received the joint Ph.D. degree in electrical engineering and embedded systems from the Eindhoven University of Technology, Eindhoven, The Netherlands, and the National University of Singapore (NUS), Singapore, in 2009. From 2009 to 2015, he was with NUS. He is currently a Professor with Technische Universität Dresden, Dresden, Germany, where he is directing the chair of processor design....Show More
Akash Kumar (Senior Member, IEEE) received the joint Ph.D. degree in electrical engineering and embedded systems from the Eindhoven University of Technology, Eindhoven, The Netherlands, and the National University of Singapore (NUS), Singapore, in 2009. From 2009 to 2015, he was with NUS. He is currently a Professor with Technische Universität Dresden, Dresden, Germany, where he is directing the chair of processor design....View more

Author image of Siddharth Gupta
Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, India
Siddharth Gupta received the B.Tech. degree in computer science and engineering from G. L. B. I. T. M., Greater Noida, India, in 2015. He is currently pursuing the M.S. degree in computer science and engineering from IIT Indore, Indore, India. His current research interests include deep learning, approximation techniques in machine learning, and hardware realization of DNNs.
Siddharth Gupta received the B.Tech. degree in computer science and engineering from G. L. B. I. T. M., Greater Noida, India, in 2015. He is currently pursuing the M.S. degree in computer science and engineering from IIT Indore, Indore, India. His current research interests include deep learning, approximation techniques in machine learning, and hardware realization of DNNs.View more
Author image of Salim Ullah
Department of Computer Science, Technische Universität Dresden, Dresden, Germany
Salim Ullah received the B.Sc. and M.Sc. degrees in computer systems engineering from the University of Engineering and Technology at Peshawar, Pakistan. He is currently pursuing the Ph.D. degree with the Chair of Processor Design, Technische Universität Dresden. His current research interests include the design of approximate arithmetic units, approximate caches, and hardware accelerators for deep neural networks.
Salim Ullah received the B.Sc. and M.Sc. degrees in computer systems engineering from the University of Engineering and Technology at Peshawar, Pakistan. He is currently pursuing the Ph.D. degree with the Chair of Processor Design, Technische Universität Dresden. His current research interests include the design of approximate arithmetic units, approximate caches, and hardware accelerators for deep neural networks.View more
Author image of Kapil Ahuja
Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, India
Kapil Ahuja received the B.Tech. degree from IIT (BHU), India, and the M.S. and Ph.D. degrees from Virginia Tech, USA. He was a Postdoctoral Research Fellow with the Max Planck Institute, Germany. He is currently an Associate Professor in Computer Science and Engineering at IIT Indore, India. In the past, he has been a Visiting Professor with TU Braunschweig, Germany, TU Dresden, Germany, and Sandia National Labs, USA. He has a varied background, including degrees in computer science, mathematics, and mechanical engineering. He is working on the mathematics of data science and computational science, specifically machine learning, numerical linear algebra, and optimization.
Kapil Ahuja received the B.Tech. degree from IIT (BHU), India, and the M.S. and Ph.D. degrees from Virginia Tech, USA. He was a Postdoctoral Research Fellow with the Max Planck Institute, Germany. He is currently an Associate Professor in Computer Science and Engineering at IIT Indore, India. In the past, he has been a Visiting Professor with TU Braunschweig, Germany, TU Dresden, Germany, and Sandia National Labs, USA. He has a varied background, including degrees in computer science, mathematics, and mechanical engineering. He is working on the mathematics of data science and computational science, specifically machine learning, numerical linear algebra, and optimization.View more
Author image of Aruna Tiwari
Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, India
Aruna Tiwari (Member IEEE) received the B.E./M.E. degrees in computer science and engineering from SGSITS Indore, India, and the Ph.D. degree in computer science and engineering from RGPV Bhopal, India. She has a background in computer science and engineering. She has more than 20 years of teaching and research experience. She is with IIT Indore, India, since 2012. She is currently an Associate Professor of computer science and engineering. Her research interests include soft-computing learning algorithms, especially with neural networks, fuzzy clustering, and evolutionary computation for different problems for handling big data mainly for disease diagnosis and genomics.
Aruna Tiwari (Member IEEE) received the B.E./M.E. degrees in computer science and engineering from SGSITS Indore, India, and the Ph.D. degree in computer science and engineering from RGPV Bhopal, India. She has a background in computer science and engineering. She has more than 20 years of teaching and research experience. She is with IIT Indore, India, since 2012. She is currently an Associate Professor of computer science and engineering. Her research interests include soft-computing learning algorithms, especially with neural networks, fuzzy clustering, and evolutionary computation for different problems for handling big data mainly for disease diagnosis and genomics.View more
Author image of Akash Kumar
Department of Computer Science, Technische Universität Dresden, Dresden, Germany
Akash Kumar (Senior Member, IEEE) received the joint Ph.D. degree in electrical engineering and embedded systems from the Eindhoven University of Technology, Eindhoven, The Netherlands, and the National University of Singapore (NUS), Singapore, in 2009. From 2009 to 2015, he was with NUS. He is currently a Professor with Technische Universität Dresden, Dresden, Germany, where he is directing the chair of processor design. His current research interests include the design, analysis, and resource management of low-power and fault-tolerant embedded multiprocessor systems.
Akash Kumar (Senior Member, IEEE) received the joint Ph.D. degree in electrical engineering and embedded systems from the Eindhoven University of Technology, Eindhoven, The Netherlands, and the National University of Singapore (NUS), Singapore, in 2009. From 2009 to 2015, he was with NUS. He is currently a Professor with Technische Universität Dresden, Dresden, Germany, where he is directing the chair of processor design. His current research interests include the design, analysis, and resource management of low-power and fault-tolerant embedded multiprocessor systems.View more

References

References is not available for this document.