A Highly Parallel FPGA Implementation of Sparse Neural Network Training | IEEE Conference Publication | IEEE Xplore

A Highly Parallel FPGA Implementation of Sparse Neural Network Training


Abstract:

This paper describes the development of an FPGA implementation of a parallel and reconfigurable architecture for sparse neural networks, capable of on-chip training and i...Show More

Abstract:

This paper describes the development of an FPGA implementation of a parallel and reconfigurable architecture for sparse neural networks, capable of on-chip training and inference. The network connectivity uses pre-determined, structured sparsity to significantly reduce complexity by lowering memory and computational requirements. The architecture uses a notion of edge-processing, leading to efficient pipelining and parallelization. Moreover, the device can be reconfigured to trade off resource utilization with training time to fit networks and datasets of varying sizes. The combined effects of complexity reduction and easy reconfigurability enable greater exploration of network hyperparameters and structures on-chip. As proof of concept, we show implementation results on an Artix-7 FPGA.
Date of Conference: 03-05 December 2018
Date Added to IEEE Xplore: 14 February 2019
ISBN Information:

ISSN Information:

Conference Location: Cancun, Mexico

Contact IEEE to Subscribe

References

References is not available for this document.