Text Description Generation from Videos via Deep Semantic Models | IEEE Conference Publication | IEEE Xplore