Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units  | IEEE Conference Publication | IEEE Xplore