By Topic

Evaluation of various parameter sets in spoken digits recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Ichikawa, A. ; Hitachi Limited, Tokyo, Japan ; Nakano, Y. ; Nakata, K.

Various parameter sets-including a spectrum envelope, cepstrum, autocorrelation function, linear predictive coefficients, and partial autocorrelation coefficients (PAC's)- are evaluated experimentally to determine which constitutes the best parameter in spoken digit recognition. The principle of recognition is simple pattern matching in the parameter space with nonlinear adjustment of the time axis. The spectrum envelope and cepstrum attain the best recognition score of 100 percent for ten spoken digits of a single-male speaker. PAC's seem to be preferable because of their ease of extraction and theoretical orthogonalities; however, these PAC's tend to suffer from computation errors when computed by fixed-point arithmetic with a short accumulator length. We find two effective means to improve the errors; one is variable use of the PAC dimensions controlled by computation accuracy, and the other is smoothing along the time axis. With these improvements the PAC's offer almost 100 percent recognition.

Published in:

Audio and Electroacoustics, IEEE Transactions on  (Volume:21 ,  Issue: 3 )