By Topic

Application of vector quantized hidden Markov modeling to telephone network based connected digit recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

5 Author(s)
Buhrke, E.R. ; AT&T Bell Labs., Murray Hill, NJ, USA ; Cardin, R. ; Normandin, Y. ; Rahim, M.
more authors

Connected digit speech recognition in the telephone network is becoming increasingly more important as the demand for speech technology becomes widespread. In the past few years, several highly successful techniques for recognizing spoken connected digit strings have been proposed. Although these techniques have been applied to non-telephone based speech [e.g. Texas Instruments database], they have produced high recognition performance. Further, similar levels of performances have been demonstrated using discrete density and continuous density based hidden Markov models (HMMs). The success of the vector quantized (VQ) modeling approach, in particular, is encouraging and rather important from the viewpoint of computational efficiency. This paper presents a study of connected digit recognition on telephone network based data using VQ HMMs. We investigate several factors affecting the error rate of VQ HMMs-such as maximum mutual information (MMI) training, sender modeling, and codebook size-and measure their contributions to recognition accuracy. The model architecture, number of states and transitions, is also optimized and its contribution to overall performance discussed

Published in:

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on  (Volume:i )

Date of Conference:

19-22 Apr 1994