Custom instructions with local memory elements without expensive DMA transfers | IEEE Conference Publication | IEEE Xplore