Generating coherent natural language annotations for video streams | IEEE Conference Publication | IEEE Xplore