Self-supervised object detection from audio-visual correspondence | IEEE Conference Publication | IEEE Xplore