Multi-Modal Hierarchical Attention-Based Dense Video Captioning | IEEE Conference Publication | IEEE Xplore