Opara: Exploiting Operator Parallelism for Expediting DNN Inference on GPUs | IEEE Journals & Magazine | IEEE Xplore