LOOG: Improving GPU Efficiency With Light-Weight Out-Of-Order Execution | IEEE Journals & Magazine | IEEE Xplore