Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units | IEEE Conference Publication | IEEE Xplore