A 29.12-TOPS/W Vector Systolic Accelerator With NAS-Optimized DNNs in 28-nm CMOS | IEEE Journals & Magazine | IEEE Xplore