A 40 TOPS Single-Chip Accelerator Enabling Low-Latency Inference for Deep Neural Networks | IEEE Journals & Magazine | IEEE Xplore