FC-U2-Net: A Novel Deep Neural Network for Singing Voice Separation | IEEE Journals & Magazine | IEEE Xplore

FC-U2-Net: A Novel Deep Neural Network for Singing Voice Separation


Abstract:

Singing voice separation, which aims to separate vocals and accompaniment from mixed musical signal, has been a popular topic. In this work, we propose a novel deep neura...Show More

Abstract:

Singing voice separation, which aims to separate vocals and accompaniment from mixed musical signal, has been a popular topic. In this work, we propose a novel deep neural network called FC-U2-Net for singing voice separation. The network is a two-level nested U-structure, in which the time-invariant fully-connected layers are added along the frequency axis. This structure enables it to capture not only the local and global contextual information, but also the long-range correlations of voice signal along the frequency axis. In addition, a novel loss function combining ratio mask and binary mask is proposed. This strategy makes the estimated vocals signal cleaner and carries less accompaniment signals. The experimental results show that our method surpasses four state-of-the-art methods on the MUSDB18 singing voice separation task, and the source-to-distortion ratio (SDR) reaches to the optimal 7.53 dB.
Page(s): 489 - 494
Date of Publication: 05 January 2022

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.