By Topic

Robust voice activity detection algorithm based on the perceptual wavelet packet transform

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

5 Author(s)
Shi-Huang Chen ; Dept. of Comput. Sci. & Inf. Eng., Shu-Te Univ., Kaohsiung, Taiwan ; Hsin-Te Wu ; Chia-Hsiang Chen ; Jiun-Ching Ruan
more authors

In this paper, a robust voice activity detection (VAD) algorithm based on the perceptual wavelet packet transform (PWPT) is proposed. The first step of this new VAD algorithm is to make use of the PWPT to decompose the input speech into 17 critical subband signals. To enhance energy of voice frames and decay energy of unvoice frames, the voice activity shape (VAS) is derived from the Teager energy operator (TEO) of these critical subband signals. Then the adaptive weighted threshold (AWT) value can be calculated from the second derivative recursive mean (SDRM) of the VAS and environments noise estimation. It is shown in this paper that the AWT is a robust threshold value for VAD under various noisy environments. One of advantages of this new algorithm is that the preset threshold values are not necessary. In addition, the proposed algorithm can adapt VAD threshold value to variable speech conditions. Experimental results show that the new VAD algorithm outperforms the G.729B and adaptive multi rate (AMR) VAD.

Published in:

Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on

Date of Conference:

13-16 Dec. 2005