Extraction of mono-aural vocal and non-vocal components exploiting ‘ANOVA’ computational method in REPET | IEEE Conference Publication | IEEE Xplore

Extraction of mono-aural vocal and non-vocal components exploiting ‘ANOVA’ computational method in REPET


Abstract:

The distinction of the lead varying vocals from the background music in an audio recording is an extremely demanding and exigent task. The speech-separation research usua...Show More

Abstract:

The distinction of the lead varying vocals from the background music in an audio recording is an extremely demanding and exigent task. The speech-separation research usually inculcates Time-frequency masking technique that ultimately appraises the hearing-aid design. The core principle in music which is capitalized to discriminate underlying non-vocals from vocals (speech) is Repetition. The `Repetition' feature is especially enacted for pop songs where the singer often overlays frequently changing vocals on a periodically repeating background in a mixture. The basic approach of this research paper is the recognisation of periodically repeating segments in audio excerpts, analogize them with a repeating model and finally discrimate the repeating musical patterns via Time-Frequency masking. A TF mask is grounded on the basis of TF representation of any signal. In the proposed algorithm, the quality of foreground vocals and accompanying background can be adjudicated in terms of SIR (Signal to Interference Ratio) value utilizing `ANOVA' (Analysis Of Variation) computational method on six different genres of musical audios.
Date of Conference: 10-12 September 2015
Date Added to IEEE Xplore: 11 January 2016
ISBN Information:
Conference Location: Indore, India

Contact IEEE to Subscribe

References

References is not available for this document.