A combined DMA and application-specific prefetching approach for tackling the memory latency bottleneck | IEEE Journals & Magazine | IEEE Xplore