Bundling: reducing the overhead of multiprocessor prefetchers | IEEE Conference Publication | IEEE Xplore