Abstract:
Various ideal masks have been used as the training targets for supervised speech separation. While different choices often lead to different results, the reason remains u...Show MoreMetadata
Abstract:
Various ideal masks have been used as the training targets for supervised speech separation. While different choices often lead to different results, the reason remains unclear. In this paper, an oracle method is applied to investigate the properties of the ideal masks including the ideal binary mask (IBM), the ideal ratio mask (IRM), the phase sensitive mask (PSM) and the complex ideal ratio maks (cIRM). They are evaluated in terms of intelligibility and quality under a general speech separation scenario. The relative importance of phase is also taken into consideration. Moreover, we introduce a novel ideal gain mask (IGM) which performs generally better than IRM and one alternative of the cIRM. It is shown that the masks are not equally designed and the results provide a strong reference to the potential performance of supervised algorithms.
Date of Conference: 13-16 September 2016
Date Added to IEEE Xplore: 24 October 2016
ISBN Information: