Variational Student: Learning Compact and Sparser Networks In Knowledge Distillation Framework | IEEE Conference Publication | IEEE Xplore