Loading [MathJax]/extensions/MathMenu.js
Audio-visual face detection for tracking in a meeting room environment | IEEE Conference Publication | IEEE Xplore

Audio-visual face detection for tracking in a meeting room environment


Abstract:

A key task in many applications such as tracking or face recognition is the detection and localisation of a subject's face in an image. This can still prove to be a chall...Show More

Abstract:

A key task in many applications such as tracking or face recognition is the detection and localisation of a subject's face in an image. This can still prove to be a challenging task particularly in low resolution or noisy images. Here we propose a robust method for face detection using both audio and visual information. We construct a dictionary learning based face detector using a set of distinctive and robust image features. We then train a support vector machine classifier using sparse image representations produced by this dictionary to classify face versus background. This is combined with the azimuth angle of the speaker produced by an audio localisation system to constrain the search space for the subject's face. This increases the efficiency of the detection and localisation process by limiting the search area. However, more importantly, the audio information allows us to know a priori the number of subjects in the image. This greatly reduces the possibility of false positive face detections. We demonstrate the advantage of this proposed approach over traditional face detection methods on the challenging AV16.3 dataset.
Date of Conference: 09-12 July 2013
Date Added to IEEE Xplore: 21 October 2013
ISBN Information:
Conference Location: Istanbul, Turkey

References

References is not available for this document.