BiRD: Bi-Directional Input Reuse Dataflow for Enhancing Depthwise Convolution Performance on Systolic Arrays | IEEE Journals & Magazine | IEEE Xplore