Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment | IEEE Journals & Magazine | IEEE Xplore