Leveraging Multimodal Knowledge for Spatio-Temporal Action Localization | IEEE Conference Publication | IEEE Xplore