By Topic

Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Hirose, K. ; Sch. of Frontier Sci., Tokyo Univ., Japan ; Iwano, K.

We have been developing a reliable method of prosodic word boundary detection for Japanese continuous speech based on the statistical modeling of mora transitions of fundamental frequency contours of prosodic words. Modifications in the codebook sizes and in the HMM topologies improved the boundary detection performance. When using mora boundary information obtainable from the phoneme recognition process, the detection rates were reached around 73% with 12.5% insertion errors for speaker-open experiments. This method was then integrated to a continuous speech recognition system with unlimited vocabulary. The integrated system conducts the recognition process in two stages: the first stage is to detect mora boundaries without prosodic information and the second stage is to increase the mora recognition rate using prosodic word boundary information. Slight improvements in mora recognition rates were observed both in speaker-closed and -open experiments

Published in:

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on  (Volume:3 )

Date of Conference:

2000