Visual Commonsense-Aware Representation Network for Video Captioning | IEEE Journals & Magazine | IEEE Xplore