Automatic video annotation through search and mining
Moxley, E.
Tao Mei
Xian-Sheng Hua
Wei-Ying Ma
Manjunath, B.S.
Vision Res. Lab., Univ. of California, Santa Barbara, CA;
This paper appears in: Multimedia and Expo, 2008 IEEE International Conference on
Publication Date: June 23 2008-April 26 2008
On page(s): 685-688
Location: Hannover,
ISBN: 978-1-4244-2570-9
INSPEC Accession Number: 10178675
Digital Object Identifier: 10.1109/ICME.2008.4607527
Current Version Published: 2008-08-26
Abstract
Conventional approaches to video annotation predominantly focus on supervised identification of a limited set of concepts, while unsupervised annotation with infinite vocabulary remains unexplored. This work aims to exploit the overlap in content of news video to automatically annotate by mining similar videos that reinforce, filter, and improve the original annotations. The algorithm employs a two-step process of search followed by mining. Given a query video consisting of visual content and speech-recognized transcripts, similar videos are first ranked in a multimodal search. Then, the transcripts associated with these similar videos are mined to extract keywords for the query. We conducted extensive experiments over the TRECVID 2005 corpus and showed the superiority of the proposed approach to using only the mining process on the original video for annotation. This work represents the first attempt at unsupervised automatic video annotation leveraging overlapping video content.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.