Learning Feature Semantic Matching for Spatio-Temporal Video Grounding | IEEE Journals & Magazine | IEEE Xplore