What Makes the Sound?: A Dual-Modality Interacting Network for Audio-Visual Event Localization | IEEE Conference Publication | IEEE Xplore