Jily: Cost-Aware AutoScaling of Heterogeneous GPU for DNN Inference in Public Cloud | IEEE Conference Publication | IEEE Xplore