Multimodal Video Summarization using Attention based Transformers (MVSAT) | IEEE Conference Publication | IEEE Xplore