Optimization of Compiler-Generated OpenCL CNN Kernels and Runtime for FPGAs | IEEE Conference Publication | IEEE Xplore