In this paper we describe a model for multimodal topic segmentation and classification of Dutch news video. A focal topic of interest for the research reported here is the interaction between three different modalities (visual, auditory and textual information) in an integrated model for video analysis. We present a fully automated sequential feedback model for video analysis, where linguistic analysis is combined with visual information for the purposes of both segmentation and classification.
Published in:
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
(Volume:2
)
Date of Conference: 2002