By Topic

Particle Filter Enhancement of Speech Spectral Amplitudes

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Laska, B.N.M. ; Res. in Motion, Ltd., Waterloo, ON, Canada ; Bolić, M. ; Goubran, R.A.

This paper presents a particle filter approach to spectral amplitude speech enhancement. Spectral amplitudes are known to exhibit inter-frame dependencies and non-Gaussian statistics; however, incorporating these properties makes closed-form solutions intractable. Using the particle filter framework allows the presented algorithm to model the speech spectral amplitudes as an autoregressive process with Laplace distributed excitation. Two variants of the standard algorithm are also presented: one that uses an interacting multiple model approach to account for transitions between active speech and silence intervals, and one that allows for phase differences between the clean speech and noise complex Fourier transform coefficients. All of the particle sampling distributions are constrained to take the measurement into account, improving sampling efficiency. In experiments using wideband speech and real recorded noise the proposed algorithm variants are shown to offer natural-sounding output speech, with objective evaluation results that compare favorably to existing particle filter speech enhancement algorithms. The multiple model variant is found to improve inter-speech noise reduction, while the phase variant improves performance when the signal-to-noise ratio is low.

Published in:

Audio, Speech, and Language Processing, IEEE Transactions on  (Volume:18 ,  Issue: 8 )