High Throughput and Low Bandwidth Demand: Accelerating CNN Inference Block-by-block on FPGAs | IEEE Conference Publication | IEEE Xplore