Text-Conditional Visual-Language Alignment for Video Captioning | IEEE Journals & Magazine | IEEE Xplore