By Topic

Reusing Speech Techniques for Video Semantic Indexing [Applications Corner]

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Shinoda, K. ; Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan ; Inoue, N.

Many techniques developed in speech research have been successfully employed in other fields, such as automatic video semantic indexing. In this application, a user submits a textual input query for an desired object or a scene to a search system, which returns video shots that include the object or scene. Recently, a new method using Gaussian-mixture model (GMM) supervectors and support vector machines (SVMs) was proven to be very effective. In this method, speech technology such as speaker verification and adaptation techniques play very important roles.

Published in:

Signal Processing Magazine, IEEE  (Volume:30 ,  Issue: 2 )