Conferences >2020 Asia-Pacific Signal and ...

Multi-channel Speech Separation Using Deep Embedding With Multilayer Bootstrap Networks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recently, deep clustering (DPCL) based speaker-independent speech separation has drawn much attention, since it needs little speaker prior information. However, it still ...Show More

Metadata

Abstract:

Recently, deep clustering (DPCL) based speaker-independent speech separation has drawn much attention, since it needs little speaker prior information. However, it still has much room of improvement, particularly in reverberant environments. If the training and test environments mismatch which is a common case, the embedding vectors produced by DPCL may contain much noise and many small variations. To deal with the problem, we propose a variant of DPCL, named MDPCL, by applying a recent unsupervised deep learning method-multilayer bootstrap networks (MBN)-to further reduce the noise and small variations of the embedding vectors in an unsupervised way in the test stage, which fascinates k-means to produce a good result. MBN builds a gradually narrowed network from bottom-up via a stack of k-centroids clustering ensembles, where the k-centroids clusterings are trained independently by random sampling and one-nearest-neighbor optimization. To further improve the robustness of MDPCL in reverberant environments, we take spatial features as part of its input. Experimental results demonstrate the effectiveness of the proposed method.

Published in: 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Date of Conference: 07-10 December 2020

Date Added to IEEE Xplore: 31 December 2020

ISBN Information:

ISSN Information:

Conference Location: Auckland, New Zealand

Contents

References is not available for this document.

Multi-channel Speech Separation Using Deep Embedding With Multilayer Bootstrap Networks

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Multi-channel Speech Separation Using Deep Embedding With Multilayer Bootstrap Networks

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?