Evaluation of Automatically Generated Video Captions Using Vision and Language Models | IEEE Conference Publication | IEEE Xplore