By Topic

Speech enhancement based on auto gain control

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Nagata, Y. ; Dept. of Comput. & Inf. Sci., Iwate Univ., Morioka, Japan ; Fujioka, T. ; Abe, M.

We propose a new method of speech enhancement based on auto gain control (AGC) using two channel inputs to deal with transient noises. Auto gain control is considered to be relatively ineffective for reducing noises that are superimposed on speech. Nevertheless, it offers advantages for addressing problems posed by musical noise and spectral distortion. This method combines two operations for obtaining accurate gain. One is spectral subtraction for two-channel input (2chSS); the other is self-offset of the noise with pre-whitening. This study also addresses a coherence based post-filter to reduce uncorrelated noise components among channels. The proposed method is evaluated in experiments across three noise conditions in which (i) impulsive noises, (ii) stationary car noise, and (iii) speech noise are present, respectively. Objective measures and spectrograms demonstrate marked improvements over other two-microphone based methods, but subjective preference tests reveal that the proposed method is less preferred than the equivalent of a nonprocessed signal in the case of stationary car noise (ii). The performance of the proposed method and the conventional 2chSS were even in the case of speech noise (iii). These results of subjective tests reflect some disadvantages of the AGC processing. Those drawbacks involve degradation of noise consistency in stationary noise conditions and residual noises in desired speech segments. Nevertheless, subjective tests in the case of noise (i) demonstrate that the proposed method is the most preferred among the methods compared here. The effectiveness of the proposed method is confirmed particularly for this noise condition.

Published in:

Audio, Speech, and Language Processing, IEEE Transactions on  (Volume:14 ,  Issue: 1 )