Learning to Localize Sound Source in Visual Scenes | IEEE Conference Publication | IEEE Xplore