By Topic

Improved voice activity detection via contextual information and noise suppression

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
A. Sangwan ; Dept. of Electr. & Electron. Eng., Concordia Univ., Montreal, Que., Canada ; W. P. Zhu ; M. O. Ahmad

In this paper, we develop a contextual voice activity detection (VAD) scheme which combines both contextual and frame specific information to improve detection. Unlike many VAD algorithms which assume that the cues to activity lie within the frame alone, our scheme seeks information for activity in the current as well as the neighboring frames. The new approach provides good robustness in low SNR when the speech frame is corrupted and an alternate reliable source of activity information is necessary. Further, we present a simple noise suppression scheme to enhance the VAD performance at low SNR. The noise suppressor provides spectrally reshaped signal to the VAD. Finally, we combine the contextual VAD and the noise suppression scheme with a basic detector to form a comprehensive VAD. The proposed comprehensive VAD system is tested on speech samples from the SWITCHBOARD database. Various noises under different SNRs are added to the speech signals. Experimental results show that the proposed VAD outperforms the standard algorithm ETSI AMR VAD-1.

Published in:

2005 IEEE International Symposium on Circuits and Systems

Date of Conference:

23-26 May 2005