Enhancing Multimodal Alignment with Momentum Augmentation for Dense Video Captioning | IEEE Conference Publication | IEEE Xplore