Visually Assisted Self-supervised Audio Speaker Localization and Tracking | IEEE Conference Publication | IEEE Xplore