Evaluating and mitigating bandwidth bottlenecks across the memory hierarchy in GPUs | IEEE Conference Publication | IEEE Xplore