Co-Attentional Transformers for Story-Based Video Understanding | IEEE Conference Publication | IEEE Xplore