Leveraging the Video-Level Semantic Consistency of Event for Audio-Visual Event Localization | IEEE Journals & Magazine | IEEE Xplore