Early and late integration of audio features for automatic video description | IEEE Conference Publication | IEEE Xplore