Loading [MathJax]/extensions/MathZoom.js
Guiding audio source separation by video object information | IEEE Conference Publication | IEEE Xplore

Guiding audio source separation by video object information


Abstract:

In this work we propose novel joint and sequential multimodal approaches for the task of single channel audio source separation in videos. This is done within the popular...Show More

Abstract:

In this work we propose novel joint and sequential multimodal approaches for the task of single channel audio source separation in videos. This is done within the popular non-negative matrix factorization framework using information about the sounding object's motion. Specifically, we present methods that utilize non-negative least squares formulation to couple motion and audio information. The proposed techniques generalize recent work carried out on NMF-based motion-informed source separation and easily extend to video data. Experiments with two distinct multimodal datasets of string instrument performance recordings illustrate their advantages over the existing methods.
Date of Conference: 15-18 October 2017
Date Added to IEEE Xplore: 11 December 2017
ISBN Information:
Electronic ISSN: 1947-1629
Conference Location: New Paltz, NY, USA

Contact IEEE to Subscribe

References

References is not available for this document.