Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning | IEEE Conference Publication | IEEE Xplore