Transformer Based Text Summary Generation for Videos | IEEE Conference Publication | IEEE Xplore