Combining HMM-based melody extraction and NMF-based soft masking for separating voice and accompaniment from monaural audio | IEEE Conference Publication | IEEE Xplore

Combining HMM-based melody extraction and NMF-based soft masking for separating voice and accompaniment from monaural audio


Abstract:

Modern monaural voice and accompaniment separation systems usually consist of two main modules: melody extraction and time frequency masking. A main distinction between d...Show More

Abstract:

Modern monaural voice and accompaniment separation systems usually consist of two main modules: melody extraction and time frequency masking. A main distinction between different separation systems lies in what approaches are used for the two modules. Popular techniques for melody extraction include hidden Markov models (HMMs) and non-negative matrix factorization (NMF), and masking includes hard and soft masking. This paper investigates the flaw of NMF-based melody extraction, and proposes the combination of HMM-based melody extraction (equipped with a newly-defined feature) and NMF-based soft masking. Evaluations on two publicly available databases show that the proposed system reaches state-of the-art performance and outperforms several other combinations.
Date of Conference: 22-27 May 2011
Date Added to IEEE Xplore: 11 July 2011
ISBN Information:

ISSN Information:

Conference Location: Prague, Czech Republic

Contact IEEE to Subscribe

References

References is not available for this document.