By Topic

A high-quality digital filterbank for speech recognition which runs in real time on a standard microprocessor

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
H. Silverman ; Brown University, Providence, RI, USA

Asynchronous digital filterbank or LPC output is widely used for high-quality research and commercial speech recognition applications. Special-purpose hardware is usually applied. In this paper, a decimation/DFT filterbank system is described which may be implemented in real time on a standard microprocessor. Key features of the approach are a dithering technique, which allows output sampling at a rate lower than would be theoretically predicted as adequate, and the Winograd Fourier transform algorithm (WFTA) implementation of the DFT. The contributions of this paper are the dithering idea, the demonstration that it is feasible to implement a high-quality digital filter-bank on a standard microprocessor, and the discussion of starting-phase error. The nonlinear effects of dithering are discussed in detail, and it is shown that the dithering technique significantly reduces starting phase error, which can be important in very close discrete utterance recognition (DUR) situations.

Published in:

IEEE Transactions on Acoustics, Speech, and Signal Processing  (Volume:34 ,  Issue: 5 )