By Topic

Speech and Song Search on the Web: System Design and Implementation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Yaguchi, Yuichi ; Univ. of Aizu, Aizuwakamatsu ; Watanabe, Y. ; Naruse, K. ; Oka, Ryuichi

This paper proposes a novel search system for speech and song segments. The amount of accumulated video data in the World Wide Web is expanding and its content is varied. Video content includes natural voices and singing voices, and these differ in their phoneme lengths. Our system uses frame-wise phoneme recognition and continuous dynamic programming (CDP). First, each target and query waveform is divided into fixed short-time frames; second, each frame of the waveform is used to estimate a phoneme label using Bayes estimation; third, the query sequences of phoneme labels are searched from target sequences by time-robustness CDP; and, finally, this system gets candidate answers. This method is robust along the time dimension, and thus has a great advantage for natural voice as well as song. This paper also introduces an implementation of this system, which is published on the Web, as a secondary search engine for Youtube data.

Published in:

Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on

Date of Conference:

16-19 Oct. 2007