Abstract:
Always-on subsystems in mobile/Internet of Things (IoT) SoCs process a variety of real-time sensor data deep neural network (DNN) classification workloads in a heavily co...Show MoreMetadata
Abstract:
Always-on subsystems in mobile/Internet of Things (IoT) SoCs process a variety of real-time sensor data deep neural network (DNN) classification workloads in a heavily constrained energy budget. This can be achieved with robust, low-voltage circuits, and specialized hardware accelerators. We present a 16-nm always-on DNN processor, which consists primarily of a microcontroller and a DNN accelerator with on-chip SRAM for the model weights. The design operates robustly from 0.4 to 1-V, with calibration-free automatic voltage/frequency tuning provided by tracking small non-zero razor timing error rates. A novel timing error-driven synchronization-free adaptive clocking scheme significantly reduces the adaptation latency to provide resilience to fast on-chip supply noise and reduce margins. To accommodate the tight energy constraints of always-on IoT workloads, we implement a multi-cycle SRAM read scheme that allows the memory voltage to scale at iso-throughput, improving energy efficiency across the entire operating range. The wide operating range allows for high performance at 1.36 GHz, low-power consumption downs to 750 μW, and stateof-the-art raw efficiency at 16-bit precision of 750 GOPS/W dense or 1.81 TOPS/W sparse.
Published in: IEEE Journal of Solid-State Circuits ( Volume: 54, Issue: 7, July 2019)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Random access memory ,
- Timing ,
- Hardware ,
- Clocks ,
- Pipelines ,
- Latches ,
- System-on-chip
- Index Terms
- Adaptive Clocking ,
- Neural Network ,
- Deep Neural Network ,
- Energy Efficiency ,
- Time Error ,
- Energy Budget ,
- Adaptive Scheme ,
- Hardware Accelerators ,
- Convolutional Neural Network ,
- Noise Sources ,
- Supply Voltage ,
- Spiking Neural Networks ,
- Clock Frequency ,
- Clock Cycles ,
- Human Activity Recognition ,
- Voltage Sag ,
- Datapath ,
- Varactor ,
- Voltage Noise ,
- Memory Bandwidth ,
- On-chip Memory ,
- Voltage Scaling ,
- Detection Circuit ,
- Test Chip ,
- Critical Endpoint ,
- Clock Period ,
- Guard Band ,
- Aggression Scale ,
- Inverter
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Random access memory ,
- Timing ,
- Hardware ,
- Clocks ,
- Pipelines ,
- Latches ,
- System-on-chip
- Index Terms
- Adaptive Clocking ,
- Neural Network ,
- Deep Neural Network ,
- Energy Efficiency ,
- Time Error ,
- Energy Budget ,
- Adaptive Scheme ,
- Hardware Accelerators ,
- Convolutional Neural Network ,
- Noise Sources ,
- Supply Voltage ,
- Spiking Neural Networks ,
- Clock Frequency ,
- Clock Cycles ,
- Human Activity Recognition ,
- Voltage Sag ,
- Datapath ,
- Varactor ,
- Voltage Noise ,
- Memory Bandwidth ,
- On-chip Memory ,
- Voltage Scaling ,
- Detection Circuit ,
- Test Chip ,
- Critical Endpoint ,
- Clock Period ,
- Guard Band ,
- Aggression Scale ,
- Inverter
- Author Keywords