Dense Video Captioning With Early Linguistic Information Fusion | IEEE Journals & Magazine | IEEE Xplore