Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding | IEEE Conference Publication | IEEE Xplore