Work-in-Progress: Flexible Group-Level Pruning of Deep Neural Networks for Fast Inference on Mobile GPUs | IEEE Conference Publication | IEEE Xplore