An efficient and flexible inference system for serving heterogeneous ensembles of deep neural networks | IEEE Conference Publication | IEEE Xplore