Latency-Aware Pruning of Neural Networks via Structure Search with Integer Programming | IEEE Conference Publication | IEEE Xplore