A Memory-aware Performance Optimization of Tensor Programs for Embedded Devices | IEEE Conference Publication | IEEE Xplore