Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution | IEEE Conference Publication | IEEE Xplore