Warped-preexecution: A GPU pre-execution approach for improving latency hiding | IEEE Conference Publication | IEEE Xplore