FPGA-based Low-Batch Training Accelerator for Modern CNNs Featuring High Bandwidth Memory | IEEE Conference Publication | IEEE Xplore