End-to-End Neural Network Compression via l1/l2 Regularized Latency Surrogates | IEEE Conference Publication | IEEE Xplore