TLP Balancer: Predictive Thread Allocation for Multitenant Inference in Embedded GPUs

TLP Balancer: Predictive Thread Allocation for Multitenant Inference in Embedded GPUs | IEEE Journals & Magazine | IEEE Xplore