By Topic

Single-channel speech separation by using a sparse decomposition with periodic structure

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Nakashizuka, M. ; Grad. Sch. of Eng. Sci., Osaka Univ., Toyonaka ; Okumura, H. ; Iiguni, Y.

In this paper, we propose single-channel speech separation by using a sparse decomposition method. First, the model for the periodic signals with time-varying amplitude is introduced to approximate speech signals. The sparse decomposition is proposed with this signal model and a sparsity measure. The sparsity measure is defined as a sum of the l2 norms of the resultant periodic subsignals to find the shortest path to the approximation. By this penalty of the sparsity, the proposed decomposition extracts significant periodic components from a mixture and has ability of the source estimation for mixtures of periodic signals. In experiments, we apply the proposed decomposition to speech mixtures and demonstrate speech separation with codebooks of the speakers. In additionally, comparison with MaxVQ method that performs separation on the frequency spectrum domain is also demonstrated. Comparing with the MaxVQ, our method is less sensitive to the codebook design and requires less computational costs.

Published in:

Intelligent Signal Processing and Communications Systems, 2008. ISPACS 2008. International Symposium on

Date of Conference:

8-11 Feb. 2009