Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU | IEEE Journals & Magazine | IEEE Xplore