Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline Modeling | IEEE Conference Publication | IEEE Xplore