By Topic

Usage of frame dropping and frame attenuation algorithms in automatic speech recognition systems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Vlaj, D. ; Electr. Eng. & Comput. Sci. Fac., Maribor Univ., Slovenia ; Kotnik, B. ; Kaciv, Z. ; Horvat, B.

In this paper the usage of frame dropping and frame attenuation algorithms in automatic speech recognition systems is presented. On the one hand, the use of frame dropping algorithms is important because the speech recognition system does not need to deal with noise-only parts of input signal, but on the other hand, the speech recognition results can be better if the spectral magnitudes of noise-only frames are attenuated. A novel approach of voice activity detection (VAD) based on the log filter-bank magnitudes needed for the frame dropping or the frame attenuation with the so-called "hangover" criterion is proposed. All tests were made on Slovenian, German, and Spanish fixed telephone SpeechDat II databases with the HTK speech recognition toolkit. The results obtained show the small word error rate can be achieved at small number of Gaussian mixtures if either frame dropping or frame attenuation algorithm is used.

Published in:

EUROCON 2003. Computer as a Tool. The IEEE Region 8  (Volume:2 )

Date of Conference:

22-24 Sept. 2003